-
-
Notifications
You must be signed in to change notification settings - Fork 18k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Initial Backport of string changes for 2.3 release #59513
Commits on Oct 3, 2024
-
PDEP-14: Dedicated string data type for pandas 3.0 (pandas-dev#58551)
Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com> Co-authored-by: Irv Lustig <irv@princeton.com> Co-authored-by: William Ayd <william.ayd@icloud.com> Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com> Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for b1b8eed - Browse repository at this point
Copy the full SHA b1b8eedView commit details -
TST / string dtype: add env variable to enable future_string and add …
…test build (pandas-dev#58459)
Configuration menu - View commit details
-
Copy full SHA for 5778049 - Browse repository at this point
Copy the full SHA 5778049View commit details
Commits on Oct 7, 2024
-
Configuration menu - View commit details
-
Copy full SHA for a494ed8 - Browse repository at this point
Copy the full SHA a494ed8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 06dbb7a - Browse repository at this point
Copy the full SHA 06dbb7aView commit details -
String dtype: rename the storage options and add
na_value
keyword i……n `StringDtype()` (pandas-dev#59330) * rename storage option and add na_value keyword * update init * fix propagating na_value to Array class + fix some tests * fix more tests * disallow pyarrow_numpy as option + fix more cases of checking storage to be pyarrow_numpy * restore pyarrow_numpy as option for now * linting * try fix typing * try fix typing * fix dtype equality to take into account the NaN vs NA * fix pickling of dtype * fix test_convert_dtypes * update expected result for dtype='string' * suppress typing error with _metadata attribute
Configuration menu - View commit details
-
Copy full SHA for 925c21c - Browse repository at this point
Copy the full SHA 925c21cView commit details -
TST (string dtype): xfail all currently failing tests with future.inf…
…er_string (pandas-dev#59329) * TST (string dtype): xfail all currently failing tests with future.infer_string * more xfails * more xfails * add missing strict=False * also run slow and single cpu tests * fix single_cpu tests * xfail some slow tests * stop suppressing non-zero exit code from pytest on string CI build * remove accidentally added xlsx file --------- Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 431b246 - Browse repository at this point
Copy the full SHA 431b246View commit details -
TST (string dtype): follow-up on pandas-devGH-59329 fixing new xfails (…
…pandas-dev#59352) * TST (string dtype): follow-up on pandas-devGH-59329 fixing new xfails * add missing strict
Configuration menu - View commit details
-
Copy full SHA for 6882ef9 - Browse repository at this point
Copy the full SHA 6882ef9View commit details -
TST (string dtype): change any_string_dtype fixture to use actual dty…
…pe instances (pandas-dev#59345) * TST (string dtype): change any_string_dtype fixture to use actual dtype instances * avoid pyarrow import error during test collection * fix dtype equality in case pyarrow is not installed * keep using mode.string_storage as default for NA variant + more xfails * fix test_series_string_inference_storage_definition * remove no longer necessary xfails --------- Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 99ebd18 - Browse repository at this point
Copy the full SHA 99ebd18View commit details -
TST (string dtype): remove usage of arrow_string_storage fixture (pan…
…das-dev#59368) * TST (string dtype): remove usage of arrow_string_storage fixture * fixup
Configuration menu - View commit details
-
Copy full SHA for 1566042 - Browse repository at this point
Copy the full SHA 1566042View commit details -
TST (string dtype): replace string_storage fixture with explicit stor…
…age/na_value keyword arguments for dtype creation (pandas-dev#59375)
Configuration menu - View commit details
-
Copy full SHA for 1d77d0e - Browse repository at this point
Copy the full SHA 1d77d0eView commit details -
String dtype: restrict options.mode.string_storage to python|pyarrow …
…(remove pyarrow_numpy) (pandas-dev#59376) * String dtype: restrict options.mode.string_storage to python|pyarrow (remove pyarrow_numpy) * add type annotation
Configuration menu - View commit details
-
Copy full SHA for 2465a6d - Browse repository at this point
Copy the full SHA 2465a6dView commit details -
API/TST: expand tests for string any/all reduction + fix pyarrow-base…
…d implementation (pandas-dev#59414)
Configuration menu - View commit details
-
Copy full SHA for 35ebe68 - Browse repository at this point
Copy the full SHA 35ebe68View commit details -
String dtype: implement object-dtype based StringArray variant with N…
…umPy semantics (pandas-dev#58451) Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 463fd91 - Browse repository at this point
Copy the full SHA 463fd91View commit details -
REF (string dtype): de-duplicate _str_map methods (pandas-dev#59443)
* REF: de-duplicate _str_map methods * mypy fixup
Configuration menu - View commit details
-
Copy full SHA for 397cb09 - Browse repository at this point
Copy the full SHA 397cb09View commit details -
String dtype: use 'str' string alias and representation for NaN-varia…
…nt of the dtype (pandas-dev#59388)
Configuration menu - View commit details
-
Copy full SHA for dd2680c - Browse repository at this point
Copy the full SHA dd2680cView commit details -
String dtype: fix alignment sorting in case of python storage (pandas…
…-dev#59448) * String dtype: fix alignment sorting in case of python storage * add test
Configuration menu - View commit details
-
Copy full SHA for a9fd6f1 - Browse repository at this point
Copy the full SHA a9fd6f1View commit details -
TST (string dtype): add test build with future strings enabled withou…
…t pyarrow (pandas-dev#59437) * TST (string dtype): add test build with future strings enabled without pyarrow * ensure the build doesn't override the default ones * uninstall -> remove * avoid jobs with same env being cancelled * use different python version for both future jobs * add some xfails * fixup xfails * less strict
Configuration menu - View commit details
-
Copy full SHA for bf7fb01 - Browse repository at this point
Copy the full SHA bf7fb01View commit details -
REF (string dtype): de-duplicate _str_map (2) (pandas-dev#59451)
* REF (string): de-duplicate _str_map (2) * mypy fixup
Configuration menu - View commit details
-
Copy full SHA for 81850c8 - Browse repository at this point
Copy the full SHA 81850c8View commit details -
REF (string): de-duplicate str_map_nan_semantics (pandas-dev#59464)
REF: de-duplicate str_map_nan_semantics
Configuration menu - View commit details
-
Copy full SHA for 078c5a0 - Browse repository at this point
Copy the full SHA 078c5a0View commit details -
BUG (string dtype): convert dictionary input to materialized string a…
…rray in ArrowStringArray constructor (pandas-dev#59479)
Configuration menu - View commit details
-
Copy full SHA for fdbd473 - Browse repository at this point
Copy the full SHA fdbd473View commit details -
String dtype: fix convert_dtypes() to convert NaN-string to NA-string (…
…pandas-dev#59470) * String dtype: fix convert_dtypes() to convert NaN-string to NA-string * fix CoW tracking for conversion to python storage strings * remove xfails
Configuration menu - View commit details
-
Copy full SHA for 2346acf - Browse repository at this point
Copy the full SHA 2346acfView commit details -
String dtype: honor mode.string_storage option (and change default to…
… None) (pandas-dev#59488) * String dtype: honor mode.string_storage option (and change default to None) * fix test + explicitly test default * use 'auto' instead of None
Configuration menu - View commit details
-
Copy full SHA for 1bd3ce8 - Browse repository at this point
Copy the full SHA 1bd3ce8View commit details -
BUG (string): ArrowEA comparisons with mismatched types (pandas-dev#5…
…9505) * BUG: ArrowEA comparisons with mismatched types * move whatsnew * GH ref
Configuration menu - View commit details
-
Copy full SHA for 7e50b16 - Browse repository at this point
Copy the full SHA 7e50b16View commit details -
Configuration menu - View commit details
-
Copy full SHA for fa14a19 - Browse repository at this point
Copy the full SHA fa14a19View commit details -
Configuration menu - View commit details
-
Copy full SHA for 036e9da - Browse repository at this point
Copy the full SHA 036e9daView commit details -
TST (string dtype): fix IO dtype_backend tests for storage of str dty…
…pe of columns' Index (pandas-dev#59509)
Configuration menu - View commit details
-
Copy full SHA for 4d26bed - Browse repository at this point
Copy the full SHA 4d26bedView commit details -
REF (string): Move StringArrayNumpySemantics methods to base class (p…
…andas-dev#59514) * REF (string): Move StringArrayNumpySemantics methods to base class * mypy fixup
Configuration menu - View commit details
-
Copy full SHA for 31153c1 - Browse repository at this point
Copy the full SHA 31153c1View commit details -
REF (string): remove _str_na_value (pandas-dev#59515)
* REF (string): remove _str_na_value * mypy fixup
Configuration menu - View commit details
-
Copy full SHA for 721bf1e - Browse repository at this point
Copy the full SHA 721bf1eView commit details -
REF (string): move ArrowStringArrayNumpySemantics methods to base cla…
…ss (pandas-dev#59501) * REF: move ArrowStringArrayNumpySemantics methods to parent class * REF: move methods to ArrowStringArray * mypy fixup * Fix incorrect double-unpacking * move methods to subclass
Configuration menu - View commit details
-
Copy full SHA for ceee52d - Browse repository at this point
Copy the full SHA ceee52dView commit details -
API (string): return str dtype for .dt methods, DatetimeIndex methods (…
…pandas-dev#59526) * API (string): return str dtype for .dt methods, DatetimeIndex methods * mypy fixup
Configuration menu - View commit details
-
Copy full SHA for 38f5b61 - Browse repository at this point
Copy the full SHA 38f5b61View commit details -
Configuration menu - View commit details
-
Copy full SHA for a35481f - Browse repository at this point
Copy the full SHA a35481fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 172af49 - Browse repository at this point
Copy the full SHA 172af49View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7946df1 - Browse repository at this point
Copy the full SHA 7946df1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6909c47 - Browse repository at this point
Copy the full SHA 6909c47View commit details -
Configuration menu - View commit details
-
Copy full SHA for b70cd48 - Browse repository at this point
Copy the full SHA b70cd48View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1718e4b - Browse repository at this point
Copy the full SHA 1718e4bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3467d26 - Browse repository at this point
Copy the full SHA 3467d26View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9142e5e - Browse repository at this point
Copy the full SHA 9142e5eView commit details -
String dtype: still return nullable NA-variant in object inference (`…
…maybe_converts_object`) if requested (pandas-dev#59487) * String dtype: maybe_converts_object give precedence to nullable dtype * update datetimelike input validation * update tests and remove xfails * explicitly test pd.array() behaviour (remove xfail) * fixup allow_2d * undo changes related to datetimelike input validation * fix test for str on current main --------- Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for b61bd23 - Browse repository at this point
Copy the full SHA b61bd23View commit details -
Configuration menu - View commit details
-
Copy full SHA for c3d3980 - Browse repository at this point
Copy the full SHA c3d3980View commit details -
Configuration menu - View commit details
-
Copy full SHA for e3728c7 - Browse repository at this point
Copy the full SHA e3728c7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 732aa90 - Browse repository at this point
Copy the full SHA 732aa90View commit details -
Configuration menu - View commit details
-
Copy full SHA for 66e26d1 - Browse repository at this point
Copy the full SHA 66e26d1View commit details -
Configuration menu - View commit details
-
Copy full SHA for e9806c1 - Browse repository at this point
Copy the full SHA e9806c1View commit details -
Configuration menu - View commit details
-
Copy full SHA for db9aa77 - Browse repository at this point
Copy the full SHA db9aa77View commit details -
Configuration menu - View commit details
-
Copy full SHA for b3257e7 - Browse repository at this point
Copy the full SHA b3257e7View commit details -
Configuration menu - View commit details
-
Copy full SHA for cecef0e - Browse repository at this point
Copy the full SHA cecef0eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4c0d118 - Browse repository at this point
Copy the full SHA 4c0d118View commit details -
Configuration menu - View commit details
-
Copy full SHA for fc6bd39 - Browse repository at this point
Copy the full SHA fc6bd39View commit details -
Configuration menu - View commit details
-
Copy full SHA for bae9be1 - Browse repository at this point
Copy the full SHA bae9be1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 94b797d - Browse repository at this point
Copy the full SHA 94b797dView commit details -
Configuration menu - View commit details
-
Copy full SHA for a10c5c0 - Browse repository at this point
Copy the full SHA a10c5c0View commit details