This repository will demonstrate my growing experience with data analysis using various applications and programming languages.
Currently, you may view the following:
Tools: Python Jupyter NotebookTechniques: data cleaning, analysis, and visualization using pandas, seaborn, numpy, and matplotlib
Dataset: https://www.kaggle.com/datasets/danielgrijalvas/movies?resource=download Tools: Python Jupyter Notebook
Techniques: data cleaning, analysis, and visualization using NLTK, lambda, seaborn, and plotly
Dataset: Tweets with Covid-19 hashtags from July 24, 2020 to August 30, 2020. Provided by a Coursera guided project. Tools: Web of Science, Sci2, OpenRefine
Techniques: .isi file conversion with Sci2, data cleaning with OpenRefine, and basic productivity charts with Power BI
Dataset: Used Web of Science database to collect 2017-2022 publication data for Tennessee Technological University. Tools: Sci2, Gephi, Inkscape
Techniques: network analysis and pruning with Sci2, and visualization with Gephi
Dataset: Used Web of Science database to collect 2017-2022 publication data for Tennessee Technological University.