Don't acquire the semaphore for empty input while scanning #4476

abellina · 2022-01-07T22:45:27Z

Signed-off-by: Alessandro Bellina abellina@nvidia.com

This PR removes instances where we acquired the GpuSemaphore within the scan code, specifically those scans that are PartitionReaders. The change is to stop acquiring the semaphore here, because these cases will not be consumed downstream, given how the PartitionReader is used.

PartitionReader provides a next() method that returns false when there isn't work left. The PartitionReaderIterator wraps these readers, as far as I can tell, and it exposes the iterator interface (hasNext() returns false if the PartitionReader returned a false in next()). This allows us to stop requiring us to acquire the semaphore in the next() method, when there isn't any work left in the reader.

The PR also adds an acquire in the aggregate where we are producing rows out of nothing. This should be a noop, and likely just a miss that it wasn't there before (protected by upstream nodes doing the acquire for the agg).

The change has modest gains in NDS so far. I want to run a few more times, but so far I see: 10 queries that are 1.20x+ faster, 39 queries that are 1.10x+, and most queries (92) are above 1. I don't see a lot of regression here, except for q94 with 2 seconds, but q94 is falling back to the CPU and can be unpredictable. In other words, it seems to be mostly good, but I want to run a few more tests.

@revans2 wanted comments explaining why we are not acquiring the semaphore. I wasn't sure where to put them, and they are the same each time. I could see us moving them to the PartitionReaderIterator or somewhere else more appropriate.

Signed-off-by: Alessandro Bellina <abellina@nvidia.com>

abellina · 2022-01-07T22:48:33Z

hmm looks like the build is failing on not being able to create the docker container.

jlowe

Looks good to me other than copyright nits.

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuMultiFileReader.scala

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuOrcScanBase.scala

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuParquetScanBase.scala

jlowe · 2022-01-10T17:00:14Z

build

abellina · 2022-01-10T17:51:49Z

I ran another iteration of NDS, with similar results. Most queries are slightly faster, with some more significant gains in those single digit queries like q2 q53 q55 where we are gaining up to 3 seconds.

Overall I see ~42 seconds over all of NDS at 2TB.

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuMultiFileReader.scala

jlowe · 2022-01-11T14:58:08Z

build

jbrennan333

+1 this looks good to me

Don't acquire the semaphore for empty batches while scanning

cc677dc

Signed-off-by: Alessandro Bellina <abellina@nvidia.com>

sameerz added the performance A performance related task/issue label Jan 8, 2022

sameerz added this to the Jan 10 - Jan 28 milestone Jan 8, 2022

jlowe reviewed Jan 10, 2022

View reviewed changes

Update copyrights

179a1dc

jlowe previously approved these changes Jan 10, 2022

View reviewed changes

abellina marked this pull request as ready for review January 10, 2022 17:45

revans2 reviewed Jan 10, 2022

View reviewed changes

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuMultiFileReader.scala Outdated Show resolved Hide resolved

Update comments in scans per review suggestion

76dce96

abellina dismissed jlowe’s stale review via 76dce96 January 11, 2022 14:49

abellina changed the title ~~Don't acquire the semaphore for empty batches while scanning~~ Don't acquire the semaphore for empty input while scanning Jan 11, 2022

jlowe approved these changes Jan 11, 2022

View reviewed changes

jbrennan333 approved these changes Jan 11, 2022

View reviewed changes

abellina merged commit b17c685 into NVIDIA:branch-22.02 Jan 11, 2022

abellina mentioned this pull request Jan 19, 2022

[FEA] could the parquet scan code avoid acquiring the semaphore for an empty batch? #4392

Closed

jlowe mentioned this pull request May 26, 2022

What's the update of RapidsShuffleManager to resolve the bottleneck for waiting to acquire the semaphore #5650

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't acquire the semaphore for empty input while scanning #4476

Don't acquire the semaphore for empty input while scanning #4476

abellina commented Jan 7, 2022

abellina commented Jan 7, 2022

jlowe left a comment

jlowe commented Jan 10, 2022

abellina commented Jan 10, 2022

jlowe commented Jan 11, 2022

jbrennan333 left a comment

Don't acquire the semaphore for empty input while scanning #4476

Don't acquire the semaphore for empty input while scanning #4476

Conversation

abellina commented Jan 7, 2022

abellina commented Jan 7, 2022

jlowe left a comment

Choose a reason for hiding this comment

jlowe commented Jan 10, 2022

abellina commented Jan 10, 2022

jlowe commented Jan 11, 2022

jbrennan333 left a comment

Choose a reason for hiding this comment