re This script takes the number of excel files from a folder, reads required columns' information and aggregates the information based on a certain criteria. (eg. Company name in given data)
- Download python from https://www.python.org/downloads/ and install
- Run cmd as admin
- Run command
pip install pandas
- Run command
pip install xlrd
- Run command
pip install openpyxl
- Add variables in main.py for inputDataFilesFolder (directory where .xlxs files are present), columnsToExtract (which columns do you want to extract), sumOverColumns (Column to group by and sum), outputFilePath (where you want to generate the final output excel file)
- Run
cd <cloned_repo_code_folder>
- Run
py main.py
- Run
py excel_data_reader.py
- Make sure to close the input and output excel file windows from excel app to avoid permission denied error.
- Output would be generated at specified outputFilePath.