BUG: different apply function behavior when columns with type Timestamp present #17602

xvwei1989 · 2017-09-20T06:15:00Z

Code Sample, a copy-pastable example if possible

# Your code here
import pandas as pd
df = pd.DataFrame([[1,2],[1,2]],columns=['a','b'])
print df.apply(lambda x: {'s':x['a']+x['b']},1)
################
# (AS EXPECTED)
# output: 
# 0    {u's': 3}
# 1    {u's': 3}
# dtype: object
################

# add one new column with type Timestamp
df['tm'] = [pd.Timestamp('2017-05-01 00:00:00'),pd.Timestamp('2017-05-02 00:00:00')]
print df.apply(lambda x: {'s':x['a']+x['b']},1)

################
#(WRONG OUTPUT)
# output: 
#       a  b   tm
# 0   NaN NaN NaN
# 1   NaN NaN NaN
################

Problem description

when the return type of apply function is dict, if a new column with type Timestamp is added to the dataframe, the result will be unexpected even if the apply function is unchanged

Output of `pd.show_versions()`

commit: None
python: 2.7.13.final.0
python-bits: 64
OS: Darwin
OS-release: 14.5.0
machine: x86_64
processor: i386
byteorder: little
LC_ALL: None
LANG: zh_CN.UTF-8
LOCALE: None.None

pandas: 0.20.3
pytest: None
pip: 9.0.1
setuptools: 36.0.1
Cython: 0.25.2
numpy: 1.13.1
scipy: 0.19.1
xarray: None
IPython: 5.4.1
sphinx: None
patsy: 0.4.1
dateutil: 2.6.1
pytz: 2017.2
blosc: None
bottleneck: None
tables: None
numexpr: None
feather: None
matplotlib: 2.0.2
openpyxl: None
xlrd: 1.0.0
xlwt: None
xlsxwriter: 0.9.6
lxml: None
bs4: None
html5lib: 0.999999999
sqlalchemy: 1.1.14
pymysql: 0.7.11.None
psycopg2: None
jinja2: 2.9.6
s3fs: None
pandas_gbq: None
pandas_datareader: None

The text was updated successfully, but these errors were encountered:

jreback · 2017-09-20T10:09:56Z

duplicate of #16353 and #15628

.apply infers the output dimension based on what you are returning, which looks exactly like a Series. This is not idiomatic pandas, not to mention non-performant.

closes pandas-dev#16353 closes pandas-dev#17348 closes pandas-dev#17437 closes pandas-dev#18573 closes pandas-dev#17970 closes pandas-dev#17892 closes pandas-dev#17602

closes pandas-dev#16353 closes pandas-dev#17348 closes pandas-dev#17437 closes pandas-dev#18573 closes pandas-dev#17970 closes pandas-dev#17892 closes pandas-dev#17602 closes pandas-dev#18775

closes pandas-dev#16353 closes pandas-dev#17348 closes pandas-dev#17437 closes pandas-dev#18573 closes pandas-dev#17970 closes pandas-dev#17892 closes pandas-dev#17602 closes pandas-dev#18775 closes pandas-dev#18901 closes pandas-dev#18919

closes #16353 closes #17348 closes #17437 closes #18573 closes #17970 closes #17892 closes #17602 closes #18775 closes #18901 closes #18919

…-dev#18577) closes pandas-dev#16353 closes pandas-dev#17348 closes pandas-dev#17437 closes pandas-dev#18573 closes pandas-dev#17970 closes pandas-dev#17892 closes pandas-dev#17602 closes pandas-dev#18775 closes pandas-dev#18901 closes pandas-dev#18919

jreback closed this as completed Sep 20, 2017

jreback added Duplicate Report Duplicate issue or pull request Apply Apply, Aggregate, Transform, Map Reshaping Concat, Merge/Join, Stack/Unstack, Explode labels Sep 20, 2017

jreback added this to the No action milestone Sep 20, 2017

jreback mentioned this issue Nov 30, 2017

API/BUG: .apply will correctly infer output shape when axis=1 #18577

Merged

jreback modified the milestones: No action, 0.22.0 Nov 30, 2017

jorisvandenbossche pushed a commit that referenced this issue Feb 7, 2018

API/BUG: .apply will correctly infer output shape when axis=1 (#18577)

6b0c7e7

closes #16353 closes #17348 closes #17437 closes #18573 closes #17970 closes #17892 closes #17602 closes #18775 closes #18901 closes #18919

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: different apply function behavior when columns with type Timestamp present #17602

BUG: different apply function behavior when columns with type Timestamp present #17602

xvwei1989 commented Sep 20, 2017

jreback commented Sep 20, 2017

BUG: different apply function behavior when columns with type Timestamp present #17602

BUG: different apply function behavior when columns with type Timestamp present #17602

Comments

xvwei1989 commented Sep 20, 2017

Code Sample, a copy-pastable example if possible

Problem description

Output of pd.show_versions()

jreback commented Sep 20, 2017

Output of `pd.show_versions()`