[BUG] Unnecessary to cache the batches that will be sent to Python in `FlatMapGroupInPandas`. #2238

firestarman · 2021-04-23T01:50:31Z

Actually it should be an improvment more than a bug.
The code snip is as below,

        .map { groupBatch =>
          // Cache the input batches for release after writing done.
          queue.add(groupBatch, spillCallback)
          groupBatch
        }

The Python runner will close the batches after writing them to Python, so no need to cache them in the queue for release.
Need to clean this.

firestarman · 2021-04-23T02:23:58Z

MapInPandas has the similar issue. Code snip

firestarman added bug Something isn't working ? - Needs Triage Need team to review and classify labels Apr 23, 2021

firestarman self-assigned this Apr 23, 2021

firestarman mentioned this issue Apr 23, 2021

Do not cache the batches in Map and Group Map Panda UDF nodes. #2239

Merged

sameerz added this to the Apr 26 - May 7 milestone Apr 26, 2021

sameerz removed the ? - Needs Triage Need team to review and classify label Apr 27, 2021

jlowe closed this as completed in #2239 Apr 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Unnecessary to cache the batches that will be sent to Python in `FlatMapGroupInPandas`. #2238

[BUG] Unnecessary to cache the batches that will be sent to Python in `FlatMapGroupInPandas`. #2238

firestarman commented Apr 23, 2021 •

edited

Loading

firestarman commented Apr 23, 2021 •

edited

Loading

[BUG] Unnecessary to cache the batches that will be sent to Python in FlatMapGroupInPandas. #2238

[BUG] Unnecessary to cache the batches that will be sent to Python in FlatMapGroupInPandas. #2238

Comments

firestarman commented Apr 23, 2021 • edited Loading

firestarman commented Apr 23, 2021 • edited Loading

[BUG] Unnecessary to cache the batches that will be sent to Python in `FlatMapGroupInPandas`. #2238

[BUG] Unnecessary to cache the batches that will be sent to Python in `FlatMapGroupInPandas`. #2238

firestarman commented Apr 23, 2021 •

edited

Loading

firestarman commented Apr 23, 2021 •

edited

Loading