ENH: Enable unary math operations for pandas, sqlite #1071

cpcloud · 2017-07-19T21:56:11Z

Implement decimal for pandas
Add SQLite unary ops
Fix operations in postgres that require numeric

cpcloud · 2017-07-21T15:38:09Z

@wesm can you take a look here? bit of a rabbit hole with decimals, otherwise just adding unary ops to series.

wesm

Minor comments, but otherwise LGTM

wesm · 2017-07-24T01:09:20Z

ibis/pandas/client.py

+        if column_name in schema:
+            ibis_type = dt.validate_type(schema[column_name])
+        elif dtype == np.object_:
+            inferred_dtype = infer_dtype(df[column_name].dropna())


Yikes. I guess we should make a NaN-friendly type inference function someplace (seems like an oversight in infer_dtype originally)

can u post an issue in pandas tracker about this

done: pandas-dev/pandas#17059

I PR'd it :) pandas-dev/pandas#17066.

wesm · 2017-07-24T01:12:04Z

ibis/pandas/execution.py

+def execute_series_unary_op(op, data, scope=None):
+    function = getattr(np, type(op).__name__.lower())
+    if data.dtype == np.dtype(np.object_):
+        return data.apply(functools.partial(execute_node, op, scope=scope))


Is Series.apply different from Series.map (in behavior or performance)?

Don't think so, @jreback any idea here?

So, it looks like Series.map accepts a dict, to support a simple CASE-statement-like operation, as well as callables, whereas apply only deals with callables. For callables, both methods use the same underlying function lib.map_infer to call the passed in callable in a Cython loop. Here we could use either since we're only dealing with callables.

wesm · 2017-07-24T01:13:42Z

ibis/pandas/execution.py

+def execute_series_log_with_base(op, data, base, scope=None):
+    if data.dtype == np.dtype(np.object_):
+        func = np.vectorize(functools.partial(execute_node, op, scope=scope))
+        return pd.Series(func(data, base), index=data.index, name=data.name)


Perhaps this bit could be factored out into a helper function since it's repeated a couple times (in case it's useful in future execution rules)

Yeah, good idea!

wesm · 2017-07-24T01:16:04Z

ibis/sql/postgres/compiler.py

+
+def _floor_divide(t, expr):
+    left, right = map(t.translate, expr.op().args)
+    return sa.func.floor(left / right)


Does integer division in postgres yield doubles?

No, it yields integers. This is implemented so that it works regardless of the type of left and right.

Got it, I was just curious =)

Implement decimal for pandas Add SQLite unary ops Fix operations in postgres that require numeric

cpcloud · 2017-07-27T17:06:08Z

Merging on green.

cpcloud force-pushed the unary-ops branch 3 times, most recently from 0dc84c3 to 13a0422 Compare July 20, 2017 00:30

cpcloud self-assigned this Jul 20, 2017

cpcloud added the feature Features or general enhancements label Jul 20, 2017

cpcloud added this to the 0.11.3 milestone Jul 20, 2017

cpcloud force-pushed the unary-ops branch 5 times, most recently from d6c9085 to 3322844 Compare July 21, 2017 02:39

cpcloud requested a review from wesm July 21, 2017 15:37

cpcloud force-pushed the unary-ops branch from 3322844 to e059f7f Compare July 22, 2017 18:22

wesm reviewed Jul 24, 2017

View reviewed changes

cpcloud added 2 commits July 27, 2017 13:01

ENH: Enable unary math operations for pandas, sqlite

0d863d1

Implement decimal for pandas Add SQLite unary ops Fix operations in postgres that require numeric

REF: Factor vectorize object function

d94b0c6

cpcloud force-pushed the unary-ops branch from e46867f to d94b0c6 Compare July 27, 2017 17:01

BUG: Pass args and kwargs

57ff2b1

cpcloud closed this in 9882b5a Jul 27, 2017

cpcloud deleted the unary-ops branch July 27, 2017 19:48

gerrymanoim mentioned this pull request Dec 18, 2020

Dask backend execution #2557

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Enable unary math operations for pandas, sqlite #1071

ENH: Enable unary math operations for pandas, sqlite #1071

cpcloud commented Jul 19, 2017

cpcloud commented Jul 21, 2017

wesm left a comment

wesm Jul 24, 2017

jreback Jul 24, 2017

cpcloud Jul 24, 2017

cpcloud Jul 25, 2017

wesm Jul 24, 2017

cpcloud Jul 25, 2017

cpcloud Jul 27, 2017

wesm Jul 24, 2017

cpcloud Jul 25, 2017

wesm Jul 24, 2017

cpcloud Jul 24, 2017

wesm Jul 24, 2017

cpcloud commented Jul 27, 2017

ENH: Enable unary math operations for pandas, sqlite #1071

ENH: Enable unary math operations for pandas, sqlite #1071

Conversation

cpcloud commented Jul 19, 2017

cpcloud commented Jul 21, 2017

wesm left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cpcloud commented Jul 27, 2017