-
Notifications
You must be signed in to change notification settings - Fork 651
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BUG: modin on ray produce error with empty dataframes #5430
Comments
Surprisingly, in my case this problem appears with non-empty array after |
@Egor-Krivov you seem to have quite a funny Modin version reported, how did you install that? cc @dchigarev I wonder if your recent I've run the reproducer on my Windows laptop, and I'm seeing a different error here:
|
Indeed, my version is strange. I will reproduce my installation and check what will change after that. |
There are two distinct problems causing this behavior that I found so far (hope I won't find more :D):
|
Maybe this error log will help (this was not an empty dataframe):
|
Signed-off-by: Dmitry Chigarev <dmitry.chigarev@intel.com>
Modin version checks
I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest released version of Modin.
I have confirmed this bug exists on the main branch of Modin. (In order to do this you can follow this guide.)
Reproducible Example
Issue Description
This code produces an error with modin on ray. However, works on pandas. I have a bit different code in my benchmark and it just works on pandas but fails on modin.
Expected Behavior
Modin on ray should behave like pandas.
Error Logs
Installed Versions
UserWarning: Setuptools is replacing distutils.
INSTALLED VERSIONS
commit : c30ab4c
python : 3.8.15.final.0
python-bits : 64
OS : Linux
OS-release : 5.4.0-124-generic
Version : #140-Ubuntu SMP Thu Aug 4 02:23:37 UTC 2022
machine : x86_64
processor : x86_64
byteorder : little
LC_ALL : None
LANG : en_US
LOCALE : en_US.ISO8859-1
Modin dependencies
modin : 0.7.3+1359.gc30ab4c1
ray : 2.0.1
dask : None
distributed : None
hdk : present
pandas dependencies
pandas : 1.5.1
numpy : 1.23.4
pytz : 2022.6
dateutil : 2.8.2
setuptools : 65.5.0
pip : 22.2.2
Cython : 0.29.32
pytest : 7.2.0
hypothesis : None
sphinx : None
blosc : None
feather : None
xlsxwriter : None
lxml.etree : None
html5lib : None
pymysql : None
psycopg2 : None
jinja2 : None
IPython : 8.6.0
pandas_datareader: None
bs4 : None
bottleneck : 1.3.5
brotli :
fastparquet : None
fsspec : 2022.11.0
gcsfs : None
matplotlib : 3.6.2
numba : None
numexpr : 2.8.4
odfpy : None
openpyxl : None
pandas_gbq : None
pyarrow : 6.0.0
pyreadstat : None
pyxlsb : None
s3fs : None
scipy : 1.9.3
snappy : None
sqlalchemy : None
tables : None
tabulate : None
xarray : None
xlrd : None
xlwt : None
zstandard : None
tzdata : None
The text was updated successfully, but these errors were encountered: