-
-
Notifications
You must be signed in to change notification settings - Fork 18.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Enhancement: XLSB support in read_excel() #8540
Comments
Based on https://github.com/python-excel/xlrd/issues/83 it seems that XLRD won't have XLSB support for a while (if ever) The only liberally licensed tool for XLSB support is https://github.com/SheetJS/js-xlsx which uses JS but ships with a nodejs-powered script that can be run from the command line |
I found that link too, but I was hoping I could compile a command line utility that I would call from within Python instead of having to run a node server to execute the conversions. |
@kevindavenport you need to install node but don't need to run it as a server. It's like running a PHP script with the PHP CLI |
Am I doing something wrong then by
|
@kevindavenport If you downloaded from source directly, you need to run If you run
|
Would this library help in the implementation of this feature? https://pypi.org/project/pyxlsb/ see the following solution in stack overflow: being able to do this directly from pandas would be great. |
PRs would be welcome |
I would love to take a crack - but whose endorsement do we get before spending the time trying to integrate pyxlsb natively into pandas |
@kevindavenport : Hey there, sorry that this conversation suddenly went dark. We are more than open to an implementation / PR at this point. If you have time / able, just go for it! |
@gfyoung : So would integration of pyxlsb as @velxundussa suggested be acceptable solution? |
@talamb : If you can implement and submit as a PR, we will definitely take a look. |
@talamb if interested in trying a PR you might want to take a look at #25427 and #25092 which added reading support for other formats. In a nutshell for this would want to copy the existing test files to .xlsb format, and add the appropriate parametrization in the test_readers.py module. Then subclass |
@WillAyd Thanks! Should be seeing a PR from me in the near future. |
is this issue fixed???? |
@praful-potphode We have a PR open (#29836) that is trying to address this issue. If you have any thoughts on pushing that PR forward, that would be great! |
openpyxl and xlrd do not support XLSB. I'm curious if anyone has taken a look at integrating (more like creating) the functionality into Pandas. Looks like it could be a Python package in it self.
Spec from Microsoft:
http://msdn.microsoft.com/en-us/library/cc313133(v=office.12).aspx
The text was updated successfully, but these errors were encountered: