Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tangled-up-in-unicode dependency means installing pandas-profiling takes ~2GB #1068

Closed
3 tasks done
Julian opened this issue Sep 23, 2022 · 4 comments
Closed
3 tasks done
Assignees
Labels
dependencies 🔗 Pull requests that update a dependency file

Comments

@Julian
Copy link

Julian commented Sep 23, 2022

Current Behaviour

⊙  python3.10 -m venv venv && venv/bin/python -m pip install --quiet pandas-profiling
...
⊙  du -ha venv | sort -h | tail -n 20                                                                                                                                               julian@Airm
25M	venv/lib/python3.10/site-packages/numpy/.dylibs
45M	venv/lib/python3.10/site-packages/statsmodels
54M	venv/lib/python3.10/site-packages/numpy
56M	venv/lib/python3.10/site-packages/pandas
103M	venv/lib/python3.10/site-packages/scipy
194M	venv/lib/python3.10/site-packages/tangled_up_in_unicode/u13_0_0_data/__pycache__/prop_list_to_property.cpython-310.pyc
194M	venv/lib/python3.10/site-packages/tangled_up_in_unicode/u14_0_0_data/__pycache__/prop_list_to_property.cpython-310.pyc
318M	venv/lib/python3.10/site-packages/tangled_up_in_unicode/u13_0_0_data/__pycache__/unicode_data_to_name_start.cpython-310.pyc
318M	venv/lib/python3.10/site-packages/tangled_up_in_unicode/u14_0_0_data/__pycache__/unicode_data_to_name_start.cpython-310.pyc
540M	venv/lib/python3.10/site-packages/tangled_up_in_unicode/__pycache__
540M	venv/lib/python3.10/site-packages/tangled_up_in_unicode/__pycache__/tangled_up_in_unicode_12_0_1.cpython-310.pyc
588M	venv/lib/python3.10/site-packages/tangled_up_in_unicode/u13_0_0_data/__pycache__
588M	venv/lib/python3.10/site-packages/tangled_up_in_unicode/u14_0_0_data/__pycache__
602M	venv/lib/python3.10/site-packages/tangled_up_in_unicode/u13_0_0_data
602M	venv/lib/python3.10/site-packages/tangled_up_in_unicode/u14_0_0_data
1.8G	venv/lib/python3.10/site-packages/tangled_up_in_unicode
2.1G	venv
2.1G	venv/lib
2.1G	venv/lib/python3.10
2.1G	venv/lib/python3.10/site-packages

(This is filed upstream at dylan-profiler/tangled-up-in-unicode#10 which has some detail on why, but filing here for visibility)

Expected Behaviour

A smaller unicode database :)

Data Description

N/A

Code that reproduces the bug

No response

pandas-profiling version

3.3.0

Dependencies

tangled-up-in-unicode 0.2.0

OS

macOS 12.5.1 + Linux

Checklist

  • There is not yet another bug report for this issue in the issue tracker
  • The problem is reproducible from this bug report. This guide can help to craft a minimal bug report.
  • The issue has not been resolved by the entries listed under Common Issues.
@Julian
Copy link
Author

Julian commented Sep 23, 2022

(CC @cfbolz)

@sbrugman sbrugman added dependencies 🔗 Pull requests that update a dependency file and removed needs-triage labels Sep 25, 2022
@sbrugman sbrugman self-assigned this Sep 25, 2022
@sbrugman
Copy link
Collaborator

Hi @Julian, will look into this

@akx
Copy link
Contributor

akx commented Sep 29, 2022

@fabclmnt
Copy link
Contributor

Waiting for visions release.

Development closed by dependency #1070.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dependencies 🔗 Pull requests that update a dependency file
Projects
None yet
Development

No branches or pull requests

4 participants