Skip to content

bfiliks/extractVariants

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

<iframe width="900" height="800" frameborder="0" scrolling="no" src="//plotly.com/~foke2/3.embed"></iframe> # Introduction to the project This is my capstone project on visualizing Dickinson's textual variants If anyone is interested in working with the Emily Dickinson Archive as data, visit https://www.edickinson.org/

Extracting Variants in EDA: A Computational Approach

When working with large data, it ususally necessary to specify what foramt/method one needs to use to pull the data from an existing database. It's easy to retreive data using API request. In this project, we've utilize the python scraping code to retrieve all files from the webiste since there was no API for the original EDA . See file under code folder for details.

Data Management Plans for EV

I tried to document in a specific way the data managment plans for this project. This provides guide in the entire project lifecylce. These include data collection, documentation, storage, sharing, and preservation.

  • Data Collection - file formats, naming conventions, version control
    • Script, data, results, docs
  • Documentation and Metadata - methodology, code, data dictionaries, metadata standard, README files
  • Storage and Backup - requirements, backup and retention schedules, access controls
    • 3-2-1 rule
  • Preservation - see https://zenodo.org/records/10316549
  • Sharing and Reuse

Visualizing Varaints in EDA

The visual representations are saved under data_visualization folder. You can also check the code folder too see and test the code on your own data or on EDA.

Conclusion

This is data visualization project examining Dickinson's textual variants. There are interesting connections and relationship that base texts share with their corresponding variants.

newplot(7)