API: remove the table keyword, replaced by fmt='s|t' #4645

jreback · 2013-08-22T22:05:30Z

API: the fmt keyword now replaces the table keyword; allowed values are s|t
the same defaults as prior < 0.13.0 remain, e.g. put implies 's' (Storer) format
and append imples 't' (Table) format

closes #4584 as well

jreback · 2013-08-22T22:31:31Z

cc @michaelaye, cc @Meteore, cc @bluefir

since you guys have given comments recently...any thoughts on this API change?

bluefir · 2013-08-23T01:12:16Z

Looks good to me!

…nd and table

…es are ``s|t`` the same defaults as prior < 0.13.0 remain, e.g. ``put`` implies 's' (Storer) format and ``append`` imples 't' (Table) format

API: remove the table keyword, replaced by fmt='s|t'

michaelaye · 2013-08-26T16:20:35Z

Sorry for my silence, I was in Yellowstone completely offline! ;) I haven't used this functionality, what I am in general worried about is that knowledge of pytables becomes more and more a requirement for using pandas properly, at least for the hdf functionality. One could argue that data people need to deal with it in any case but I am loving pandas so much because it integrates many other python libraries seemlessly. In this case I wouldn't know what a 'storer' really is, apart from it's shown usage in the docs. Maybe my worries could be nullyfied by a helpful intro paragraph, unless that already exists and my 2 weeks absence made me miss it.

jreback · 2013-08-26T16:25:49Z

http://pandas.pydata.org/pandas-docs/dev/io.html#storer-format

if you can thing of a better name that storer let me know)

you don't need knowledge of the internals just reading the docs for various formats that u can store

lmk if this is still unclear

alvorithm · 2013-08-27T08:13:38Z

Sorry, was offline as well.

The new convention seems a bit more cryptic, but I have no other objection, and no code depending on the old one (but I am planning on using the HDF IO very soon). The name 'storer' could be substituted by something more indicative of what it is as opposed to 'table' (that also is a storer in a wide sense), though admittedly that may involve mentioning some pytables|hdf5-specific lingo.

Not yet landed home, just a first impression from a quick glance. Grain of salt.

michaelaye · 2013-08-27T16:18:02Z

maybe 'fixed'(format) vs 'table' ?

michaelaye · 2013-08-27T22:33:19Z

Reading a bit further, I definitely agree with the API change due to my feeling that there is no natural preference between one or the other format, something that easily could be presumed using booleans as switch.
I am wondering though, could it be benefitial to offer 2 more wrapper calls that imply the respective format setting? Something like to_hdfixed() (or hdstorer()) and to_hdtable() maybe? Or is this cluttering the API too much?

jreback · 2013-08-27T23:03:02Z

I like your suggestion - going to go with

format=fixed(f) | table(t)

I'll changed all the storer refs to fixed

I don't think should make additional to_hdf methods to much clutter ; and I think it makes sense to have a default of format=fixed (which is the equivalent of table=False)

michaelaye · 2013-08-27T23:11:08Z

and I think it makes sense to have a default of format=fixed (which is the equivalent of table=False)

Really? I would have thought that most users expect the table to be append-able? Something along the credo of 'functionality before speed', so let the hardcore user that requires speed find out about the non-default setting?

jreback · 2013-08-27T23:52:11Z

the reason fixed is the default is just back compat (HDFStore originally started with just a fixed type)

what about an option setting eg

io.hdf_format = fixed (but you can changed the default to table)

then to_hdf will respect a passed format but default o the option setting?

michaelaye · 2013-08-28T00:04:17Z

I like that!

jtratner · 2013-08-28T03:08:33Z

I also find the behavior of HDFStore confusing to understand. What happens
if you get table instead of storer? If the only difference is a performance
hit, then maybe you could consider changing the default? Global default is
nice (though maybe it should be set via a method instead? (since I don't
think you can have module-level properties...)

jreback · 2013-08-28T03:23:31Z

tables are fundamentally different than fixed

they can be appended and queried (via expression)

see put vs append

the default is for back compat

they are two different storage back ends

think hard disk vs tape (not a great analogy because fixed are much faster)

PyTables supports many different types of storage formats (because HDF5 does)

the impetuous for the format parameter in general is really to support a new table type at some point

ctable - or column oriented tables

the user has to select a backend at creation time and they each have fundamental different access patterns and perf characteristics and so can/should be used in diverse situations

u basically pick the format depending in the problem

…ing format= pandas-dev#4645

jreback added 3 commits August 23, 2013 20:13

BUG/API: (GH4584) to_hdf was raising when passing both arguments appe…

7d636b0

…nd and table

CLN: pep8 pandas/io/pytables

a3abf80

API: the fmt keyword now replaces the table keyword; allowed valu…

952a342

…es are ``s|t`` the same defaults as prior < 0.13.0 remain, e.g. ``put`` implies 's' (Storer) format and ``append`` imples 't' (Table) format

jreback added a commit that referenced this pull request Aug 26, 2013

Merge pull request #4645 from jreback/hdf_api

49a21db

API: remove the table keyword, replaced by fmt='s|t'

jreback merged commit 49a21db into pandas-dev:master Aug 26, 2013

jreback mentioned this pull request Aug 31, 2013

API: change nomeclature in HDFStore to use format=fixed(f) | table(t) #4715

Merged

This was referenced Apr 3, 2014

DEPR: create issues for the current FutureWarnings in pandas #6641

Closed

Remove number of deprecated parameters/functions/classes [fix #6641] #6813

Merged

DEPR: Clean up list of deprecations from prior versions #6581

Closed

jreback mentioned this pull request Aug 23, 2015

DEPR: Bunch o deprecation removals part 2 #10892

Merged

jreback added a commit to jreback/pandas that referenced this pull request Aug 24, 2015

DEPR: Remove the table keyword in HDFStore.put/append, in favor of us…

69271b3

…ing format= pandas-dev#4645

jreback mentioned this pull request Jul 24, 2016

DEPR: deprecations log for removed issues #13777

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API: remove the table keyword, replaced by fmt='s|t' #4645

API: remove the table keyword, replaced by fmt='s|t' #4645

jreback commented Aug 22, 2013

jreback commented Aug 22, 2013

bluefir commented Aug 23, 2013

michaelaye commented Aug 26, 2013

jreback commented Aug 26, 2013

alvorithm commented Aug 27, 2013

michaelaye commented Aug 27, 2013

michaelaye commented Aug 27, 2013

jreback commented Aug 27, 2013

michaelaye commented Aug 27, 2013

jreback commented Aug 27, 2013

michaelaye commented Aug 28, 2013

jtratner commented Aug 28, 2013

jreback commented Aug 28, 2013

API: remove the table keyword, replaced by fmt='s|t' #4645

API: remove the table keyword, replaced by fmt='s|t' #4645

Conversation

jreback commented Aug 22, 2013

jreback commented Aug 22, 2013

bluefir commented Aug 23, 2013

michaelaye commented Aug 26, 2013

jreback commented Aug 26, 2013

alvorithm commented Aug 27, 2013

michaelaye commented Aug 27, 2013

michaelaye commented Aug 27, 2013

jreback commented Aug 27, 2013

michaelaye commented Aug 27, 2013

jreback commented Aug 27, 2013

michaelaye commented Aug 28, 2013

jtratner commented Aug 28, 2013

jreback commented Aug 28, 2013