API/ENH: str.split should return a DataFrame #8428

jreback · 2014-09-30T17:06:42Z

I find the behavior of str.split a bit odd, and it should by default just return a DataFrame (or maybe have a new function / option). Its straightforward to coerce it, but could/should be done internally.

(and str.extract does return a DataFrame IIRC)

In [22]: s
Out[22]: 
0            apple
1    apple, orange
2           orange
dtype: object

In [23]: s.str.split(',\s+')
Out[23]: 
0            [apple]
1    [apple, orange]
2           [orange]
dtype: object

In [24]: s.str.split(',\s+').apply(Series)
Out[24]: 
        0       1
0   apple     NaN
1   apple  orange
2  orange     NaN

The text was updated successfully, but these errors were encountered:

jorisvandenbossche · 2014-09-30T22:20:24Z

+1 I was also thinking that when reviewing the PR on the docs of this. But I think it should be an option for backwards compatibility

jreback added Enhancement API Design Strings String extension data type and string data labels Sep 30, 2014

jreback added this to the 0.15.1 milestone Sep 30, 2014

jreback added the Good as first PR label Sep 30, 2014

jreback mentioned this issue Oct 9, 2014

Feature request: Series.flatmap, DataFrame.flatmap #8517

Closed

billletson mentioned this issue Oct 28, 2014

ENH: Series.str.split can return a DataFrame instead of Series of lists #8663

Merged

jreback modified the milestones: 0.15.1, 0.16.0 Oct 29, 2014

jorisvandenbossche closed this as completed in #8663 Oct 29, 2014

billletson mentioned this issue Oct 30, 2014

API: change default return_type of Series.str.split from series to frame #8677

Closed

jreback modified the milestones: 0.15.2, 0.15.1 Oct 30, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API/ENH: str.split should return a DataFrame #8428

API/ENH: str.split should return a DataFrame #8428

jreback commented Sep 30, 2014

jorisvandenbossche commented Sep 30, 2014

API/ENH: str.split should return a DataFrame #8428

API/ENH: str.split should return a DataFrame #8428

Comments

jreback commented Sep 30, 2014

jorisvandenbossche commented Sep 30, 2014