Include number of executors per node in cluster information #1119

parthosa · 2024-06-13T21:43:39Z

Fixes #1117. Currently for cluster information, we calculate the number of nodes correctly but do not track the number of executors per node. This can generate wrong GPU cluster recommendations because there can be multiple executors per node.

This PR adds numExecsPerNode in the cluster information output file.

Changes:

Core/Java:

Calculate numExecsPerNode as maximum number of executors in any host
Update ClusterInfo and related methods.
Add Num Executor Per Node as new field in cluster information output CSV file.
Update unit test to include case with multiple executors on single node.

Output

Cluster Information Generated from Core:

File: rapids_4_spark_qualification_output_cluster_information.json

Previously:

[ {
  "appName" : "NDS - Power Run",
  "appId" : "application_169685947xxxxx",
  "eventLogPath" : "file:/Users/psarthi/Work/event-logs/xxxxxx",
  "clusterInfo" : {
    "vendor" : "dataproc",
    "coresPerExecutor" : 16,
    "numExecutorNodes" : 4,
    "driverHost" : "xxxx-dataproc-cpu-m.c.xxxx.internal"
  }
} ]

After this fix:

[ {
  "appName" : "NDS - Power Run",
  "appId" : "application_169685947xxxxx",
  "eventLogPath" : "file:/Users/psarthi/Work/event-logs/xxxxxx",
  "clusterInfo" : {
    "vendor" : "dataproc",
    "coresPerExecutor" : 16,
    "numExecsPerNode" : 6,
    "numExecutorNodes" : 4,
    "driverHost" : "xxxx-dataproc-cpu-m.c.xxxx.internal"
  }
} ]

Follow Up

Need to investigate heuristics for calculating total CPU cores of the node.
We cannot perform coresPerNode = numExecsPerNode * coresPerExecutor since a node maybe oversubscribed.
Original issue: [FEA] Qualification tool should recommend the cluster shape based on the best TCO according to our internal benchmark #1109

Signed-off-by: Partho Sarthi <psarthi@nvidia.com>

tgravescs · 2024-06-14T12:54:30Z

core/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualificationAppInfo.scala

@@ -865,9 +865,13 @@ class QualificationAppInfo(
        logWarning(s"Application $appId: Cluster with variable executor cores detected. " +
          s"Using maximum value.")
      }
+      // Group by host name, find max executors per host, and number of unique hosts
+      val groupedHosts = activeHosts.groupBy(identity)


so we don't want to just use activeHosts, we want to use all hosts to determine execs per node. That will be more likely right in case as an application finishes, if it has dynamic allocation on, it could release executors and thus the final active count could be inaccurate.

Put the number of executor nodes back to the active hosts for now but really we should try to do some timelining of this to see what was the max number in use at any time, but filed #1121 to followup with this. Put it back to what it was for now.

also it would be good to get a event log where dynamic allocation is disabled and the number of execs/hosts change over time. This is relatively easy in interactive, where you just run something then wait for the execs to idle timeout. then maybe run something again, maybe smaller.

we don't want to just use activeHosts, we want to use all hosts to determine execs per node.

Updated the code to use all hosts.

Put the number of executor nodes back to the active hosts for now

Reverted to the original logic.

Created a scenario for this (I think this meant dynamic allocation to be enabled):

get a event log where dynamic allocation is disabled and the number of execs/hosts change over time

Steps:

Created a dataproc cluster with 4 n1-standard-16 worker nodes.

Start a spark shell with --conf spark.dynamicAllocation.enabled=true --conf spark.executor.instances=8 --conf spark.executor.cores=8

Run a large spark application. UI shows all 8 executors are active (2 per worker node).

Wait for 30min. 7 executors are dead.

Run a small spark application. UI shows it 1 executor is active.

Now since we are calculating execs from all hosts, we will get correct num of exec/hosts as 2 instead of 1.

Signed-off-by: Partho Sarthi <psarthi@nvidia.com>

tgravescs

do we have any tests with the dynamic allocation test you had in the description?

Signed-off-by: Partho Sarthi <psarthi@nvidia.com>

parthosa · 2024-06-14T22:06:18Z

do we have any tests with the dynamic allocation test you had in the description?

Added unit test for dynamic allocation with comment

tgravescs · 2024-06-17T12:43:16Z

We cannot perform coresPerNode = numExecsPerNode * coresPerExecutor since a node maybe oversubscribed.

There is no way for us to know this on some platforms like yarn where they select what it schedules by. We will for now just have to make an assumption that it isn't but document and warn user

Signed-off-by: Ahmed Hussein (amahussein) <a@ahussein.me> This is a follow-up to NVIDIA#1119 This fix only works for Dataproc when the cluster argument is not in the CLI. The instance type will be set after multiplying `numExecutorCores` by `numExecutorsPerNode`

Generate num executors per node from Qual Tool

2dfe314

Signed-off-by: Partho Sarthi <psarthi@nvidia.com>

parthosa added bug Something isn't working user_tools Scope the wrapper module running CSP, QualX, and reports (python) core_tools Scope the core module (scala) labels Jun 13, 2024

parthosa requested review from tgravescs, cindyyuanjiang, amahussein and nartal1 June 13, 2024 21:43

parthosa self-assigned this Jun 13, 2024

Rename test file

3d45b42

Signed-off-by: Partho Sarthi <psarthi@nvidia.com>

parthosa force-pushed the spark-rapids-tools-1117 branch from b4a4ac0 to 3d45b42 Compare June 13, 2024 23:56

parthosa changed the title ~~Include number of executors per node in cluster inference~~ Include number of executors per node in cluster information Jun 13, 2024

parthosa removed the user_tools Scope the wrapper module running CSP, QualX, and reports (python) label Jun 14, 2024

parthosa marked this pull request as ready for review June 14, 2024 00:08

tgravescs reviewed Jun 14, 2024

View reviewed changes

Calculate numExecsPerNode from all executors

d5e9b14

Signed-off-by: Partho Sarthi <psarthi@nvidia.com>

parthosa requested a review from tgravescs June 14, 2024 19:51

Handle empty case

982bf71

Signed-off-by: Partho Sarthi <psarthi@nvidia.com>

tgravescs previously approved these changes Jun 14, 2024

View reviewed changes

Add unit test for dynamic allocation

f1f8b68

Signed-off-by: Partho Sarthi <psarthi@nvidia.com>

parthosa dismissed tgravescs’s stale review via f1f8b68 June 14, 2024 21:50

parthosa requested a review from tgravescs June 14, 2024 22:05

tgravescs approved these changes Jun 17, 2024

View reviewed changes

tgravescs merged commit 18e8e7f into NVIDIA:dev Jun 17, 2024
17 checks passed

tgravescs mentioned this pull request Jun 17, 2024

[FEA] Qualification tool should recommend the cluster shape based on the best TCO according to our internal benchmark #1109

Open

amahussein mentioned this pull request Jun 25, 2024

Recommended cluster should use executors_per_node and cores_per_executor #1138

Closed

parthosa deleted the spark-rapids-tools-1117 branch October 9, 2024 17:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include number of executors per node in cluster information #1119

Include number of executors per node in cluster information #1119

parthosa commented Jun 13, 2024 •

edited

Loading

tgravescs Jun 14, 2024

tgravescs Jun 14, 2024

parthosa Jun 14, 2024

parthosa Jun 14, 2024 •

edited

Loading

tgravescs left a comment

parthosa commented Jun 14, 2024

tgravescs commented Jun 17, 2024

Include number of executors per node in cluster information #1119

Include number of executors per node in cluster information #1119

Conversation

parthosa commented Jun 13, 2024 • edited Loading

Changes:

Output

Cluster Information Generated from Core:

Follow Up

tgravescs Jun 14, 2024

Choose a reason for hiding this comment

tgravescs Jun 14, 2024

Choose a reason for hiding this comment

parthosa Jun 14, 2024

Choose a reason for hiding this comment

parthosa Jun 14, 2024 • edited Loading

Choose a reason for hiding this comment

tgravescs left a comment

Choose a reason for hiding this comment

parthosa commented Jun 14, 2024

tgravescs commented Jun 17, 2024

parthosa commented Jun 13, 2024 •

edited

Loading

parthosa Jun 14, 2024 •

edited

Loading