FIX-#4022: Fixed empty data frame with index #4910

AndreyPavlenko · 2022-08-31T21:46:12Z

What do these changes do?

Put index into columns if there are no columns but only index.

commit message follows format outlined here
passes flake8 modin/ asv_bench/benchmarks scripts/doc_checker.py
passes black --check modin/ asv_bench/benchmarks scripts/doc_checker.py
signed commit with git commit -s
Resolves dataframe.loc() doesn`t work correctly at MODIN_STORAGE_FORMAT=omnisci #4022
tests added and passing
module layout described at docs/development/architecture.rst is up-to-date
added (Issue Number: PR title (PR Number)) and github username to release notes for next major release

codecov · 2022-08-31T21:56:55Z

Codecov Report

Merging #4910 (7c260ad) into master (45879a6) will decrease coverage by 15.07%.
The diff coverage is n/a.

❗ Current head 7c260ad differs from pull request most recent head fe3fde3. Consider uploading reports for the commit fe3fde3 to get more accurate results

@@             Coverage Diff             @@
##           master    #4910       +/-   ##
===========================================
- Coverage   84.34%   69.26%   -15.08%     
===========================================
  Files         267      267               
  Lines       19749    19746        -3     
===========================================
- Hits        16657    13678     -2979     
- Misses       3092     6068     +2976

Impacted Files	Coverage Δ
...odin/experimental/core/storage_formats/__init__.py	`0.00% <0.00%> (-100.00%)`	⬇️
...din/experimental/core/execution/native/__init__.py	`0.00% <0.00%> (-100.00%)`	⬇️
...erimental/core/storage_formats/omnisci/__init__.py	`0.00% <0.00%> (-100.00%)`	⬇️
.../core/execution/native/implementations/__init__.py	`0.00% <0.00%> (-100.00%)`	⬇️
...tive/implementations/omnisci_on_native/__init__.py	`0.00% <0.00%> (-100.00%)`	⬇️
...e/implementations/omnisci_on_native/io/__init__.py	`0.00% <0.00%> (-100.00%)`	⬇️
...mentations/omnisci_on_native/dataframe/__init__.py	`0.00% <0.00%> (-100.00%)`	⬇️
...tations/omnisci_on_native/partitioning/__init__.py	`0.00% <0.00%> (-100.00%)`	⬇️
...mentations/omnisci_on_native/calcite_serializer.py	`0.00% <0.00%> (-98.71%)`	⬇️
...plementations/omnisci_on_native/calcite_builder.py	`0.00% <0.00%> (-96.37%)`	⬇️
... and 48 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

ienkovich · 2022-09-15T18:57:37Z

modin/pandas/indexing.py

@@ -653,7 +653,7 @@ def __getitem__(self, key):
        --------
        pandas.DataFrame.loc
        """
-        if self.df.empty:
+        if len(self.df.index) == 0 and self.df.empty:


Why do you need these changes?

If the frame has only index, the following lambda fails with KeyError.

Is the problem specific to OmniSci backend or to all backends? Probably, the problem is in backend implementation of empty?

Actually, I could find only one implementation of this method - https://github.com/modin-project/modin/blob/master/modin/pandas/dataframe.py#L325

I don't understand why making a condition for default to pandas more strict might resolve any issue. We should be able to always be able to default to pandas.

You are right. Actually, the problem is not in default_to_pandas() but in loc[], which is called from lambda. Pandas returns an empty Series for df.loc[1], but modin with omnisci backend fails with KeyError: 'Column F_1 does not exist in schema'. This issue should be fixed instead.

ienkovich

LGTM!

Put index into columns if there are no columns but only index Signed-off-by: Andrey Pavlenko <andrey.a.pavlenko@gmail.com>

AndreyPavlenko force-pushed the issue-4022 branch 3 times, most recently from 1a05283 to 3f41a42 Compare September 1, 2022 18:54

AndreyPavlenko force-pushed the issue-4022 branch from 3f41a42 to 5ea0cbb Compare September 15, 2022 18:37

AndreyPavlenko marked this pull request as ready for review September 15, 2022 18:37

AndreyPavlenko requested review from a team as code owners September 15, 2022 18:37

ienkovich reviewed Sep 15, 2022

View reviewed changes

AndreyPavlenko force-pushed the issue-4022 branch 4 times, most recently from 97a1e31 to fe3fde3 Compare September 20, 2022 15:51

ienkovich previously approved these changes Sep 20, 2022

View reviewed changes

This was referenced Sep 21, 2022

Series.sample(), Series.repeat() and Series.empty() don`t work correctly at MODIN_STORAGE_FORMAT=omnisci #3984

Closed

dataframe.dropna() doesn`t work correctly at MODIN_STORAGE_FORMAT=omnisci #3933

Closed

Operations with empty frame fail #3428

Closed

AndreyPavlenko dismissed ienkovich’s stale review via 6e110f7 September 22, 2022 09:28

AndreyPavlenko force-pushed the issue-4022 branch from fe3fde3 to 6e110f7 Compare September 22, 2022 09:28

FIX-modin-project#4022: Fixed empty data frame with index

c1206c0

Put index into columns if there are no columns but only index Signed-off-by: Andrey Pavlenko <andrey.a.pavlenko@gmail.com>

AndreyPavlenko force-pushed the issue-4022 branch from 6e110f7 to c1206c0 Compare September 26, 2022 18:18

YarShev approved these changes Sep 27, 2022

View reviewed changes

YarShev merged commit 027f92a into modin-project:master Sep 27, 2022

YarShev mentioned this pull request Oct 4, 2022

REFACTOR: Remove c323f7fe385011ed849300155de07645.db file #5081

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX-#4022: Fixed empty data frame with index #4910

FIX-#4022: Fixed empty data frame with index #4910

AndreyPavlenko commented Aug 31, 2022

codecov bot commented Aug 31, 2022 •

edited

Loading

ienkovich Sep 15, 2022

AndreyPavlenko Sep 15, 2022

ienkovich Sep 15, 2022

AndreyPavlenko Sep 15, 2022

ienkovich Sep 15, 2022

AndreyPavlenko Sep 15, 2022

ienkovich left a comment

FIX-#4022: Fixed empty data frame with index #4910

FIX-#4022: Fixed empty data frame with index #4910

Conversation

AndreyPavlenko commented Aug 31, 2022

What do these changes do?

codecov bot commented Aug 31, 2022 • edited Loading

Codecov Report

ienkovich Sep 15, 2022

Choose a reason for hiding this comment

AndreyPavlenko Sep 15, 2022

Choose a reason for hiding this comment

ienkovich Sep 15, 2022

Choose a reason for hiding this comment

AndreyPavlenko Sep 15, 2022

Choose a reason for hiding this comment

ienkovich Sep 15, 2022

Choose a reason for hiding this comment

AndreyPavlenko Sep 15, 2022

Choose a reason for hiding this comment

ienkovich left a comment

Choose a reason for hiding this comment

codecov bot commented Aug 31, 2022 •

edited

Loading