Semantic Scholar Paper Data Visualizer

Extract and process academic paper data (authors, citation counts, influential citation counts, references )from any search query, cluster the papers based on their abstracts, and analyze their features using R and RShiny

1. Run code in `raw_data_creation.ipynb` to get raw data and cluster

Set up the environment first. Look at first cell for relevant code.
Set a semantic scholar search query of your choice - I used "large language models".
The embedding creation for the clustering algorithm can take some time. There is a cell in the notebook that creates the initial embeddings. The following cell uses an existing embeddings file and appends to it in case you have additional data points to add.

2. Run `RData_creation.R` to create the `.RData` file used for RShiny dashboard

3. Run `app.R` to display the RShiny dashboard

4. Deploy the dashboard to the cloud:

Create a shinyapps account here: shinyapps.io.
Follow the instructions for deployment here: Deploying to shinyapps.io.
When deploying, set the directory to the app as the folder that contains both the app.R file and the papers.RData file. I named this file LLM-papers-2023-dashboard, but you can rename it to anything else. Note that this folder name will appear in the URL of the RShiny web app. For example, mine is https://davydsadovskyy.shinyapps.io/llm-papers-2023-dashboard/.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
LLM-papers-2023-dashboard		LLM-papers-2023-dashboard
questionable_data		questionable_data
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
RData_creation.R		RData_creation.R
README.md		README.md
gpt_labels.txt		gpt_labels.txt
papers.csv		papers.csv
papers.jsonl		papers.jsonl
raw_data_creation.ipynb		raw_data_creation.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic Scholar Paper Data Visualizer

1. Run code in `raw_data_creation.ipynb` to get raw data and cluster

2. Run `RData_creation.R` to create the `.RData` file used for RShiny dashboard

3. Run `app.R` to display the RShiny dashboard

4. Deploy the dashboard to the cloud:

About

Releases

Packages

Languages

License

sadovsd/semantic-scholar-visualizer

Folders and files

Latest commit

History

Repository files navigation

Semantic Scholar Paper Data Visualizer

1. Run code in raw_data_creation.ipynb to get raw data and cluster

2. Run RData_creation.R to create the .RData file used for RShiny dashboard

3. Run app.R to display the RShiny dashboard

4. Deploy the dashboard to the cloud:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Run code in `raw_data_creation.ipynb` to get raw data and cluster

2. Run `RData_creation.R` to create the `.RData` file used for RShiny dashboard

3. Run `app.R` to display the RShiny dashboard

Packages