How to improve the performance of tpcds with spark3.3.3？ #408

MrFireChow · 2024-03-12T06:33:34Z

MrFireChow
Mar 12, 2024

Hello！I build the environment of spark3.3.3 and blaze2.0.8， then i do some tests based on 100G tpcds data，however，I did not receive any benefits compared to not using blaze，this is my spark-sql command：
spark-sql --master spark://xxxx:xxxx --conf spark.sql.extensions=org.apache.spark.sql.blaze.BlazeSparkSessionExtension --conf spark.shuffle.manager=org.apache.spark.sql.execution.blaze.shuffle.BlazeShuffleManager --conf spark.blaze.enable.smjInequalityJoin=true
I want to know if I need to add any other parameters. Looking forward to your reply！

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to improve the performance of tpcds with spark3.3.3？ #408

{{title}}

Replies: 0 comments

Select a reply

How to improve the performance of tpcds with spark3.3.3？ #408

MrFireChow Mar 12, 2024

Replies: 0 comments

MrFireChow
Mar 12, 2024