Introduce shaped arg to execute_mdx_dataframe #893

MariusWirtz · 2023-04-08T22:19:21Z

Inspired by
Kevin-Dekker@8ab9901#diff-5242c6c19161de7f529a4e5b0d0b33ca9c15f69be65fa5409881b3d8310ff655R2799-R2802

execute_mdx_dataframe, execute_view_datafame can be run with shaped=True to retrieve a similar but not fully equivalent result to execute_mdx_dataframe_shaped.

This allows retrieving a shaped data frame while specifying options that are not available in execute_mdx_dataframe_shaped. One example is the usage of use_iterative_json=True and use_blob=True which optimizes memory usage during string to json / dict conversion prior to the creation of the data frame.

Fixes #888

Inspired by Kevin-Dekker@8ab9901#diff-5242c6c19161de7f529a4e5b0d0b33ca9c15f69be65fa5409881b3d8310ff655R2799-R2802

Cubewise-gtejeda · 2023-04-11T22:07:24Z

The memory footprint was much lower when trying this, confirming previous tests.
Not sure if this is just a formatting issue, but when printing out the dataframe the formats of the numbers look different:

use_iterative_json=False:

use_iterative_json=True:

The shape and order seem to be identical, which is also expected.
However, there is an additional string appearing when using use_iterative_json=True:

For existing queries, I am worried that this might break backward compatibility. Could there be an option to suppress that string?

MariusWirtz · 2023-04-11T22:22:16Z

Thanks for the review! Good catch on the index column. I will make sure we get rid of this incorrect column label.

Regarding the formatting, this is kinda expected. The data frame is built differently and now the type is inherited. In the old execute_view_dataframe_shaped function the type would always be a string. The new approach is more intelligent IMO.

The scientific notation is just a pandas thing.
You should be able to switch to normal numbers like this

import pandas as pd
pd.options.display.float_format = '{:.5f}'.format

Cubewise-gtejeda · 2023-04-11T22:34:07Z

Thanks for the update.
The datatype inheritance - although a good thing - might break compatibility with existing reports in Power BI, since users currently would change the datatype in the query. I am not sure if this would cause an error in the query, but cannot test this in the connector until the release.

Could this be a parameter as well?

Thoughts?

MariusWirtz · 2023-04-12T10:49:08Z

The datatype inheritance - although a good thing - might break compatibility with existing reports in Power BI, since users currently would change the datatype in the query. I am not sure if this would cause an error in the query, but cannot test this in the connector until the release.

It wouldn't break any existing applications, because it only applies when use_blob=True or use_iterative_json=True is passed.
I think we are on track to release to PyPI this week.

I can't reproduce the strange column header. Please post the code that produces the data frame.

MariusWirtz · 2023-04-12T14:23:19Z

The unwanted column header has been removed.

Introduce shaped arg to execute_mdx_dataframe

9056758

Inspired by Kevin-Dekker@8ab9901#diff-5242c6c19161de7f529a4e5b0d0b33ca9c15f69be65fa5409881b3d8310ff655R2799-R2802

MariusWirtz force-pushed the feature/optimize-memory-in-dataframe-shaped branch from 3e74789 to 9056758 Compare April 11, 2023 12:56

MariusWirtz marked this pull request as ready for review April 11, 2023 18:23

MariusWirtz mentioned this pull request Apr 11, 2023

Feature request: The function power_bi.execute_view should internally use the optimized version for getting data, but still retain the same shape #888

Closed

MariusWirtz force-pushed the feature/optimize-memory-in-dataframe-shaped branch from e3e4fb4 to dac6867 Compare April 12, 2023 10:43

Finish up shaped arg for _dataframe functions

e800c21

MariusWirtz force-pushed the feature/optimize-memory-in-dataframe-shaped branch from dac6867 to e800c21 Compare April 12, 2023 13:15

MariusWirtz merged commit 59947a0 into master Apr 12, 2023

MariusWirtz deleted the feature/optimize-memory-in-dataframe-shaped branch October 15, 2024 10:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce shaped arg to execute_mdx_dataframe #893

Introduce shaped arg to execute_mdx_dataframe #893

MariusWirtz commented Apr 8, 2023 •

edited

Loading

Cubewise-gtejeda commented Apr 11, 2023

MariusWirtz commented Apr 11, 2023

Cubewise-gtejeda commented Apr 11, 2023

MariusWirtz commented Apr 12, 2023 •

edited

Loading

MariusWirtz commented Apr 12, 2023

Introduce shaped arg to execute_mdx_dataframe #893

Introduce shaped arg to execute_mdx_dataframe #893

Conversation

MariusWirtz commented Apr 8, 2023 • edited Loading

Cubewise-gtejeda commented Apr 11, 2023

MariusWirtz commented Apr 11, 2023

Cubewise-gtejeda commented Apr 11, 2023

MariusWirtz commented Apr 12, 2023 • edited Loading

MariusWirtz commented Apr 12, 2023

MariusWirtz commented Apr 8, 2023 •

edited

Loading

MariusWirtz commented Apr 12, 2023 •

edited

Loading