Any Performance Results ? #197

luweizheng · 2024-09-18T14:24:45Z

I've checked the NDS-H of this repository, and it's quite similar to TPC-H. I tested spark rapids with TPC-H's SF100 on my server with 8 NVIDIA A100 NVLink GPUs and found that the speed with 8 instances is not as fast as using CPUs. I also used optimization methods, such as setting spark.sql.files.maxPartitionBytes=2gb and spark.sql.adaptive.enabled=true.

I am using both the Pandas API on Spark. And Spark SQL is faster, but some queries are still not as fast a running on the same GPU server without GPU, only CPU.

Is this result expected?

Or is it that Spark Rapids can speed up certain data and queries, such as some queries of NDS (TPC-DS)?

The text was updated successfully, but these errors were encountered:

mattahrens · 2024-11-26T20:43:15Z

Can you share the entire Spark configuration settings that you used for your run? We have benchmarked NDS-H internally and all queries run faster on GPU, though normally we benchmark at a larger scale factor such as SF3000.

Here is a set of configs that we have used in our benchmarks:

                   "--conf" "spark.sql.adaptive.enabled=true"
                   "--conf" "spark.sql.files.maxPartitionBytes=2gb"
                   "--conf" "spark.driver.maxResultSize=2GB"
                   "--conf" "spark.driver.memory=50G"
                   "--conf" "spark.executor.cores=16"
                   "--conf" "spark.executor.memory=16G"
                   "--conf" "spark.executor.resource.gpu.amount=1"
                   "--conf" "spark.task.resource.gpu.amount=0.0625"
                   "--conf" "spark.rapids.memory.host.spillStorageSize=32G"
                   "--conf" "spark.rapids.memory.pinnedPool.size=8g"
                   "--conf" "spark.rapids.sql.concurrentGpuTasks=4"

gerashegalov added the ? - Needs Triage Need team to review and classify label Nov 26, 2024

mattahrens self-assigned this Nov 26, 2024

mattahrens removed the ? - Needs Triage Need team to review and classify label Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any Performance Results ? #197

Any Performance Results ? #197

luweizheng commented Sep 18, 2024 •

edited

Loading

mattahrens commented Nov 26, 2024

Any Performance Results ? #197

Any Performance Results ? #197

Comments

luweizheng commented Sep 18, 2024 • edited Loading

mattahrens commented Nov 26, 2024

luweizheng commented Sep 18, 2024 •

edited

Loading