Handle case where`_LIST` type is empty #1703

judahrand · 2021-07-11T11:53:27Z

What this PR does / why we need it:
This PR deals with multiple issues around empty list features regarding type inference.

It is my opinion that these fixes should be somewhat of a stop gap and that really the way that types are dealt with should be overhauled in Feast.

Which issue(s) this PR fixes:

Fixes #

Does this PR introduce a user-facing change?:

Fix issue which prevented empty lists being materialized into online stores or retrieved from historical stores.

feast-ci-bot · 2021-07-11T11:53:36Z

Hi @judahrand. Thanks for your PR.

I'm waiting for a feast-dev member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

judahrand · 2021-07-11T11:53:51Z

Fixes an issue mentioned in #1640 but not the whole issue as this does not deal with int lists.

woop · 2021-07-11T17:41:56Z

Thanks @judahrand, should we perhaps add a test to ensure that the change you've introduced works?

codecov-commenter · 2021-07-11T17:43:54Z

Codecov Report

Merging #1703 (69e43ac) into master (21f1ef7) will decrease coverage by 0.20%.
The diff coverage is 40.86%.

@@            Coverage Diff             @@
##           master    #1703      +/-   ##
==========================================
- Coverage   62.29%   62.09%   -0.21%     
==========================================
  Files          96       96              
  Lines        7363     7424      +61     
==========================================
+ Hits         4587     4610      +23     
- Misses       2776     2814      +38

Flag	Coverage Δ
unittests	`62.09% <40.86%> (-0.21%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
...s/integration/registration/test_universal_types.py	`35.41% <18.60%> (-5.82%)`	⬇️
sdk/python/tests/data/data_creator.py	`34.78% <33.33%> (-3.32%)`	⬇️
sdk/python/feast/feature.py	`74.24% <50.00%> (-0.76%)`	⬇️
sdk/python/feast/type_map.py	`51.53% <54.16%> (+3.26%)`	⬆️
sdk/python/feast/online_response.py	`84.90% <71.42%> (-7.60%)`	⬇️
sdk/python/tests/conftest.py	`70.31% <0.00%> (-1.57%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 21f1ef7...69e43ac. Read the comment docs.

judahrand · 2021-09-17T19:47:37Z

Hey @woop. Took me a while to get to this but it is easier now that the tests have been reworked a tad. Added the test you asked for. Worth revisiting?

judahrand · 2021-09-17T21:09:50Z

Wanted to go back and get the integration tests to run with the code from master + my test to make sure this is actually still broken. Also to see if those tests that failed a second ago were just flakey.

Looks like I need approval to run them again... @achals

judahrand · 2021-09-17T21:26:46Z

Wanted to go back and get the integration tests to run with the code from master + my test to make sure this is actually still broken. Also to see if those tests that failed a second ago were just flakey.

Looks like I need approval to run them again... @achals

I think you might have to unlabel and re-label ok-to-test? I would run them locally but they seem to fail even with no changes on my system (possibly macOS related 🤷‍♂️ )

judahrand · 2021-09-17T21:30:06Z

Hmmm... Maybe I'm wrong... Any idea why the integration tests are skipping?

adchia · 2021-09-19T16:35:26Z

Re python_type_to_feast_value_type:

What are the downsides to relaxing the null constraint on inference in this method? It seems relatively normal to have some feature values be null / empty e.g. a user history feature for a new user.

judahrand · 2021-09-19T16:41:23Z

Re python_type_to_feast_value_type:

What are the downsides to relaxing the null constraint on inference in this method? It seems relatively normal to have some feature values be null / empty e.g. a user history feature for a new user.

What would you suggest it should return? ValueType.UNKNOWN? I'm not sure what the downstream consequences of that might be?

adchia · 2021-09-19T17:00:41Z

I think ValueType.UNKNOWN would probably throw an error since we need some type, and presumably it got here in many cases because there was no supplied type (e.g. "inference" code)

[120, null, 1, 3, 4], I'd expect that to still infer to an int list that we'd want to support.

There's likely downstream effects of asserting that though, though I think the only real store we "support" right now is BQ and materialization crashes there (#1839)

judahrand · 2021-09-19T17:07:38Z

I think ValueType.UNKNOWN would probably throw an error since we need some type, and presumably it got here in many cases because there was no supplied type (e.g. "inference" code)

Agreed.

[120, null, 1, 3, 4], I'd expect that to still infer to an int list that we'd want to support.

There's likely downstream effects of asserting that though, though I think the only real store we "support" right now is BQ and materialization crashes there (#1839)

Yup, that's the approach I'm trying here: infer type from all the data provided rather than infer a type for each piece of data alone. This'll still fail in the case where all the data is empty lists (or all null) which isn't great in my opinion given that there is an underlying schema from the data source (at least for the DB sources).

Should we not be inferring types from the schema rather than the data really?

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

judahrand · 2021-09-19T18:31:46Z

Tests pass!

Actually, for realz ready to be reviewed now @adchia

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

sdk/python/feast/type_map.py

sdk/python/tests/integration/registration/test_universal_types.py

sdk/python/feast/feature.py

sdk/python/feast/online_response.py

sdk/python/tests/data/data_creator.py

sdk/python/tests/integration/registration/test_universal_types.py

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

sdk/python/feast/online_response.py

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

judahrand · 2021-09-21T14:14:45Z

Flakey test.... Seen it flake before.

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

adchia

/lgtm

feast-ci-bot · 2021-09-21T15:46:10Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: adchia, judahrand

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [adchia]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

judahrand requested review from achals, tsotnet, woop and a team as code owners July 11, 2021 11:53

feast-ci-bot added do-not-merge/release-note-label-needed needs-kind needs-ok-to-test labels Jul 11, 2021

feast-ci-bot added the size/S label Jul 11, 2021

judahrand force-pushed the empty-list branch from a1d4bba to e62b248 Compare September 17, 2021 18:54

judahrand requested review from adchia and felixwang9817 as code owners September 17, 2021 18:54

judahrand force-pushed the empty-list branch 2 times, most recently from 0817952 to 32db194 Compare September 17, 2021 19:45

judahrand force-pushed the empty-list branch from 32db194 to 696a5a6 Compare September 17, 2021 20:30

achals added ok-to-test and removed needs-ok-to-test labels Sep 17, 2021

judahrand force-pushed the empty-list branch 2 times, most recently from b4975e6 to b58c8c4 Compare September 17, 2021 21:05

feast-ci-bot added size/XS and removed size/S labels Sep 17, 2021

judahrand closed this Sep 17, 2021

judahrand reopened this Sep 17, 2021

judahrand force-pushed the empty-list branch from eafb0ff to e12fb63 Compare September 19, 2021 16:58

judahrand added 3 commits September 19, 2021 19:16

Infer type from all data

80364ad

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

Add one non-empty element to empty list test datasets

9cc642b

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

Accept empty lists in tests

61b40ec

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

judahrand force-pushed the empty-list branch from f7368ee to 61b40ec Compare September 19, 2021 18:16

Use ValueType.UNKNOWN instead of None

1eb019f

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

adchia reviewed Sep 20, 2021

View reviewed changes

judahrand added 3 commits September 20, 2021 20:00

Handle mix of null and non-null values better

0f8a744

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

Handle entity row type inference when Protobuf values used

1855356

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

Fix typo

8a6329f

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

adchia added the kind/bug label Sep 21, 2021

feast-ci-bot removed the needs-kind label Sep 21, 2021

adchia reviewed Sep 21, 2021

View reviewed changes

sdk/python/feast/online_response.py Outdated Show resolved Hide resolved

sdk/python/feast/online_response.py Outdated Show resolved Hide resolved

sdk/python/feast/online_response.py Show resolved Hide resolved

judahrand added 3 commits September 21, 2021 14:44

Make test config generate more clear

66e2d01

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

Add TODO

2f04fef

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

Be more strict about online entity type consistency

22e805b

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

judahrand added 3 commits September 21, 2021 16:38

Rename variable to be more precise

2c7015f

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

Add TODO: Add test where all lists are empty

c2a412d

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

Merge branch 'master' into empty-list

69e43ac

Signed-off-by: Judah Rand <17158624+judahrand@users.noreply.github.com>

adchia approved these changes Sep 21, 2021

View reviewed changes

feast-ci-bot added the lgtm label Sep 21, 2021

feast-ci-bot added the approved label Sep 21, 2021

feast-ci-bot merged commit 7d177b6 into feast-dev:master Sep 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle case where`_LIST` type is empty #1703

Handle case where`_LIST` type is empty #1703

judahrand commented Jul 11, 2021 •

edited

Loading

feast-ci-bot commented Jul 11, 2021

judahrand commented Jul 11, 2021

woop commented Jul 11, 2021

codecov-commenter commented Jul 11, 2021 •

edited

Loading

judahrand commented Sep 17, 2021

judahrand commented Sep 17, 2021 •

edited

Loading

judahrand commented Sep 17, 2021 •

edited

Loading

judahrand commented Sep 17, 2021

adchia commented Sep 19, 2021

judahrand commented Sep 19, 2021

adchia commented Sep 19, 2021

judahrand commented Sep 19, 2021

judahrand commented Sep 19, 2021 •

edited

Loading

judahrand commented Sep 21, 2021

adchia left a comment

feast-ci-bot commented Sep 21, 2021

Handle case where_LIST type is empty #1703

Handle case where_LIST type is empty #1703

Conversation

judahrand commented Jul 11, 2021 • edited Loading

feast-ci-bot commented Jul 11, 2021

judahrand commented Jul 11, 2021

woop commented Jul 11, 2021

codecov-commenter commented Jul 11, 2021 • edited Loading

Codecov Report

judahrand commented Sep 17, 2021

judahrand commented Sep 17, 2021 • edited Loading

judahrand commented Sep 17, 2021 • edited Loading

judahrand commented Sep 17, 2021

adchia commented Sep 19, 2021

judahrand commented Sep 19, 2021

adchia commented Sep 19, 2021

judahrand commented Sep 19, 2021

judahrand commented Sep 19, 2021 • edited Loading

judahrand commented Sep 21, 2021

adchia left a comment

Choose a reason for hiding this comment

feast-ci-bot commented Sep 21, 2021

Handle case where`_LIST` type is empty #1703

Handle case where`_LIST` type is empty #1703

judahrand commented Jul 11, 2021 •

edited

Loading

codecov-commenter commented Jul 11, 2021 •

edited

Loading

judahrand commented Sep 17, 2021 •

edited

Loading

judahrand commented Sep 17, 2021 •

edited

Loading

judahrand commented Sep 19, 2021 •

edited

Loading