refactor: examples data loading for tests #17893

ofekisr · 2021-12-29T19:11:53Z

The way the examples data is generated and loaded into the database as SQL data and as superset objects
is not flexible, configurable and in case you would like deterministic data it does not support it.
In addition, there are many reusing ideas and duplicated codes.

For example, in another PR I'm trying to develop I encountered surprising behavior when I tried to load world bank data.
I thought the way it is loaded and cleaned up be the same as loading the birth_names data but no.

This PR is the first one of achieving this. It abstract the way the raw data is generated.

codecov · 2021-12-30T11:46:35Z

Codecov Report

Merging #17893 (cfe2c2e) into master (8ebec60) will decrease coverage by 0.17%.
The diff coverage is 74.62%.

❗ Current head cfe2c2e differs from pull request most recent head bc7568a. Consider uploading reports for the commit bc7568a to get more accurate results

@@            Coverage Diff             @@
##           master   #17893      +/-   ##
==========================================
- Coverage   67.10%   66.92%   -0.18%     
==========================================
  Files        1609     1612       +3     
  Lines       64897    64988      +91     
  Branches     6866     6872       +6     
==========================================
- Hits        43547    43495      -52     
- Misses      19484    19626     +142     
- Partials     1866     1867       +1

Flag	Coverage Δ
hive	`53.32% <ø> (-28.50%)`	⬇️
mysql	`?`
postgres	`?`
presto	`53.16% <ø> (-28.96%)`	⬇️
python	`82.32% <ø> (-0.42%)`	⬇️
sqlite	`81.94% <ø> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
...-chart-controls/src/sections/advancedAnalytics.tsx	`33.33% <ø> (ø)`
...ntend/packages/superset-ui-core/src/color/index.ts	`100.00% <ø> (ø)`
...s/legacy-plugin-chart-country-map/src/countries.ts	`100.00% <ø> (ø)`
...legacy-plugin-chart-partition/src/controlPanel.tsx	`25.00% <ø> (ø)`
...gins/legacy-plugin-chart-rose/src/controlPanel.tsx	`50.00% <ø> (ø)`
.../plugins/legacy-preset-chart-nvd3/src/Bar/index.js	`66.66% <ø> (ø)`
...gins/legacy-preset-chart-nvd3/src/DistBar/index.js	`66.66% <ø> (ø)`
...ins/legacy-preset-chart-nvd3/src/DualLine/index.js	`66.66% <ø> (ø)`
...gins/legacy-preset-chart-nvd3/src/NVD3Controls.tsx	`95.83% <ø> (ø)`
...plugins/legacy-preset-chart-nvd3/src/ReactNVD3.jsx	`0.00% <ø> (ø)`
... and 99 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8ebec60...bc7568a. Read the comment docs.

villebro

A few minor comments, but I agree we could do with more consolidation on the example data front, as it has evolved over a long time and is inconsistent. One main question I have is can you elaborate on the need for the generator/factory/impl/singleton design being proposed here? I'm not opposed to it, but it feels like a lot of abstraction for something that may not be required. For example, in the fixture where we do return list(BirthNamesGeneratorFactory.make().generate()), I wonder if we couldn't just do something like return BirthNamesExample.get_data()? Point being, if we don't anticipate we'll need a generator (here we're just putting list` around what's being yielded) or a full blown singleton pattern, why add one?

tests/common/example_data_generator/base.py

tests/common/example_data_generator/birth_names_generator_factory.py

add tests for common

ofekisr · 2022-01-11T11:59:59Z

@villebro @amitmiran137
I added tests for common
I changed to use "..."

regarding the abstractions: this is how I'm coding ... design to interfaces
the one who uses the generator should not aware of how it generates (decouple implementations details)
Why do you fear using abstractions?
the singleton is inside the abstract class so the client is only aware of the API. the concrete details of how the generator is built are encapsulated, but there is a way to change the implementations if required and should be done in separate setup file

amitmiran137

Lgtm

* refactor: replace the way the birth_names data is generated * refactor: replace the way the birth_names data is generated * refactor structure add tests for common

pull-request-size bot added the size/XXL label Dec 29, 2021

amitmiran137 changed the title ~~Test/refactor data loading~~ refactor: examples data loading for tests Dec 29, 2021

refactor: replace the way the birth_names data is generated

0a5d2ae

ofekisr force-pushed the test/refactor_data_loading branch from cc223ea to 0a5d2ae Compare December 30, 2021 11:26

pull-request-size bot added size/L and removed size/XXL labels Dec 30, 2021

ofekisr marked this pull request as ready for review December 30, 2021 11:27

amitmiran137 requested a review from villebro December 30, 2021 14:19

villebro reviewed Jan 7, 2022

View reviewed changes

tests/common/example_data_generator/base.py Outdated Show resolved Hide resolved

tests/common/example_data_generator/birth_names_generator_factory.py Outdated Show resolved Hide resolved

tests/common/example_data_generator/birth_names_generator_factory.py Outdated Show resolved Hide resolved

ofekisr added 2 commits January 9, 2022 17:19

refactor: replace the way the birth_names data is generated

948cd30

refactor structure

bc7568a

add tests for common

pull-request-size bot added size/XL and removed size/L labels Jan 11, 2022

amitmiran137 approved these changes Jan 11, 2022

View reviewed changes

amitmiran137 merged commit 7fc6a2f into apache:master Jan 11, 2022

amitmiran137 deleted the test/refactor_data_loading branch January 11, 2022 12:16

mistercrunch added 🏷️ bot A label used by `supersetbot` to keep track of which PR where auto-tagged with release labels 🚢 1.5.0 labels Mar 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: examples data loading for tests #17893

refactor: examples data loading for tests #17893

ofekisr commented Dec 29, 2021 •

edited

Loading

codecov bot commented Dec 30, 2021 •

edited

Loading

villebro left a comment

ofekisr commented Jan 11, 2022

amitmiran137 left a comment

refactor: examples data loading for tests #17893

refactor: examples data loading for tests #17893

Conversation

ofekisr commented Dec 29, 2021 • edited Loading

codecov bot commented Dec 30, 2021 • edited Loading

Codecov Report

villebro left a comment

Choose a reason for hiding this comment

ofekisr commented Jan 11, 2022

amitmiran137 left a comment

Choose a reason for hiding this comment

ofekisr commented Dec 29, 2021 •

edited

Loading

codecov bot commented Dec 30, 2021 •

edited

Loading