Benchmarks for disk.frame. Currently, only Vaex is added.
This repo contains a docker file which downloads the Fannie Mae 1 year data, and then attempts to perform some benchmarks.
Currently, only the read csv benchmark is performed.
This only runs in Ubuntu-likes including Windows WSL.
-
Clone this repo and navigate to the directory
-
Build the docker image
./build.sh
- run Jupyter notebook from docker
./run.sh
- From Jupyter, navigate to the
notebooks
folder and run
diskframe.ipynb
and
test-vaex.ipynb