CLN: Use generators instead of lists in built-in Python functions #18276

mroeschke · 2017-11-14T03:35:04Z

tests passed
passes git diff upstream/master -u -- "*.py" | flake8 --diff

Pass generators in built-in Python functions instead of passing in a list, e.g any([value for value in iterator]) --> any(value for value in iterator)

codecov · 2017-11-14T08:34:30Z

Codecov Report

Merging #18276 into master will decrease coverage by 0.01%.
The diff coverage is 87.5%.

@@            Coverage Diff             @@
##           master   #18276      +/-   ##
==========================================
- Coverage    91.4%   91.38%   -0.02%     
==========================================
  Files         164      164              
  Lines       49878    49878              
==========================================
- Hits        45590    45581       -9     
- Misses       4288     4297       +9

Flag	Coverage Δ
#multiple	`89.19% <85.41%> (ø)`	⬆️
#single	`39.41% <22.91%> (-0.07%)`	⬇️

Impacted Files	Coverage Δ
pandas/util/testing.py	`100% <ø> (ø)`	⬆️
pandas/util/_doctools.py	`0% <0%> (ø)`	⬆️
pandas/core/frame.py	`97.8% <100%> (-0.1%)`	⬇️
pandas/io/parsers.py	`95.59% <100%> (ø)`	⬆️
pandas/core/indexes/base.py	`96.42% <100%> (ø)`	⬆️
pandas/io/json/normalize.py	`96.93% <100%> (ø)`	⬆️
pandas/core/internals.py	`94.54% <100%> (ø)`	⬆️
pandas/io/pytables.py	`92.84% <100%> (ø)`	⬆️
pandas/core/indexing.py	`92.8% <100%> (ø)`	⬆️
pandas/core/indexes/range.py	`95.66% <100%> (ø)`	⬆️
... and 11 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 69472f9...6402974. Read the comment docs.

codecov · 2017-11-14T08:34:51Z

Codecov Report

Merging #18276 into master will decrease coverage by 0.01%.
The diff coverage is 87.5%.

@@            Coverage Diff             @@
##           master   #18276      +/-   ##
==========================================
- Coverage    91.4%   91.38%   -0.02%     
==========================================
  Files         164      164              
  Lines       49878    49878              
==========================================
- Hits        45590    45581       -9     
- Misses       4288     4297       +9

Flag	Coverage Δ
#multiple	`89.19% <85.41%> (ø)`	⬆️
#single	`39.41% <22.91%> (-0.07%)`	⬇️

Impacted Files	Coverage Δ
pandas/util/testing.py	`100% <ø> (ø)`	⬆️
pandas/util/_doctools.py	`0% <0%> (ø)`	⬆️
pandas/core/internals.py	`94.54% <100%> (ø)`	⬆️
pandas/io/pytables.py	`92.84% <100%> (ø)`	⬆️
pandas/core/sparse/frame.py	`94.78% <100%> (ø)`	⬆️
pandas/core/indexes/base.py	`96.42% <100%> (ø)`	⬆️
pandas/core/frame.py	`97.8% <100%> (-0.1%)`	⬇️
pandas/core/indexes/range.py	`95.66% <100%> (ø)`	⬆️
pandas/io/html.py	`85.98% <100%> (ø)`	⬆️
pandas/core/indexing.py	`92.8% <100%> (ø)`	⬆️
... and 11 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 69472f9...6402974. Read the comment docs.

jreback · 2017-11-14T13:26:13Z

@mroeschke thanks.

can you add a linting check in ci/lint.sh to search for cases like this?

jreback · 2017-11-14T13:26:34Z

does this have any perf impact ?

mroeschke · 2017-11-14T16:03:20Z

I ran one full asv and two asvs for benchmarks/frame_methods.py and got pretty inconsistent results. I can rerun if needed.

Sure I can include this check in ci/lint.sh. I'll be available to add it later in the week.

jreback · 2017-11-15T11:29:47Z

thanks

Sure I can include this check in ci/lint.sh. I'll be available to add it later in the week.

would be great

jorisvandenbossche · 2017-11-15T16:54:28Z

I don't thing you would expect to see any difference in asv. I think this is mainly about code style / redundancy. Certainly given that, apparently (just tried it out, didn't expect this), with small lists it can actually be a bit faster using a list comprehension instead of generator (for bigger ones there is a clear difference though):

In [168]: l = [True, False]*2

In [169]: %timeit any(i for i in l)
511 ns ± 7.38 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

In [170]: %timeit any([i for i in l])
448 ns ± 11.9 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

In [171]: l = [True, False]*100

In [172]: %timeit any(i for i in l)
514 ns ± 7.66 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

In [173]: %timeit any([i for i in l])
5.7 µs ± 39.5 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

(and the cases in our code typically are small numbers, eg looping of the axes of a DataFrame (thus 2))

mroeschke added 2 commits November 13, 2017 19:16

CLN: Use generators where possible

d52560a

Fix redundant tertiary statement

6402974

jreback added the Code Style Code style, linting, code_checks label Nov 14, 2017

jreback added this to the 0.22.0 milestone Nov 15, 2017

jreback merged commit 9c799e2 into pandas-dev:master Nov 15, 2017

mroeschke mentioned this pull request Nov 17, 2017

CLN: Lint for lists instead of generators in built-in Python functions #18335

Merged

2 tasks

mroeschke deleted the use_generators branch December 20, 2017 02:04

mroeschke mentioned this pull request Mar 5, 2018

CLN: Python comprehensions and generators cleanup followup #19989

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLN: Use generators instead of lists in built-in Python functions #18276

CLN: Use generators instead of lists in built-in Python functions #18276

mroeschke commented Nov 14, 2017

codecov bot commented Nov 14, 2017 •

edited

Loading

codecov bot commented Nov 14, 2017

jreback commented Nov 14, 2017

jreback commented Nov 14, 2017

mroeschke commented Nov 14, 2017

jreback commented Nov 15, 2017

jorisvandenbossche commented Nov 15, 2017

CLN: Use generators instead of lists in built-in Python functions #18276

CLN: Use generators instead of lists in built-in Python functions #18276

Conversation

mroeschke commented Nov 14, 2017

codecov bot commented Nov 14, 2017 • edited Loading

Codecov Report

codecov bot commented Nov 14, 2017

Codecov Report

jreback commented Nov 14, 2017

jreback commented Nov 14, 2017

mroeschke commented Nov 14, 2017

jreback commented Nov 15, 2017

jorisvandenbossche commented Nov 15, 2017

codecov bot commented Nov 14, 2017 •

edited

Loading