Skip to content

Investigating the alignment of quantitative and qualitative literary analysis in American literature - UNITN

Notifications You must be signed in to change notification settings

memonji/distant-reading-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Computational Linguistics Project

CL_Montecchiari_Project.pdf

Title: Investigating the Alignment of Quantitative and Qualitative Literary Analysis in American Literature

Author: Emma Angela Montecchiari

Course: Università di Trento - Computational Linguistics 2022/23

Date: September 7, 2023

Contents

In this repository, you will find:

  • Code: The source code and scripts used in the project.
  • Data: Corpora relevant to the research.
  • Results: Graphs of the results [Appendix 2].

Project Proposal

Project Overview

This project seeks to explore the alignment between quantitative investigative techniques and qualitative analysis in the field of literary studies. Specifically, the aim is to investigate the consistency between traditionally attributed characteristics of literary movements and computational methods. The central hypothesis is that distant reading analysis can complement and validate close reading techniques, thereby revealing a potential synergy between the two approaches.

Methodology

To achieve these objectives, I have undertaken the following steps:

  1. Corpus Selection: I have compiled a diverse corpus of literary movements, selecting one for in-depth analysis.

  2. Stylistic Characterization: Within the chosen corpus, I employed stylistic characterization techniques to describe these literary movements and assess their alignment with traditional categorizations.

  3. Genre Analysis: I conducted a more refined analysis of a specific genre within the chosen literary movement and compared it to traditionally assigned characteristics. The genre chosen for a detailed comparison is American Gothic.

  4. Computational Linguistics Techniques: Computational linguistics techniques were then applied to identify similarities and differences among literary movements, broadening the scope of our analysis.

  5. Incorporating External Material: Throughout this research, I incorporated external material from classical qualitative analyses of the chosen movements, allowing for a dual perspective while predominantly applying quantitative methods.

Repository Contents

  1. Corpus Movements:

    • Plain text data selected and downloaded from Project Gutenberg. Sorted by literary movement in sub-folders. Each folder is divided by authors.
  2. TF-IDF:

    • Retrieval of the most frequent words using the TF-IDF metric.
    • Tables for plotting and visualization.
  3. Complexity Indices:

    • Metrics for measuring readability and complexity of the texts.
    • Results plotting and comparison.
  4. Cosine Similarity:

    • Code and results for computing cosine similarity matrices.
    • Comparison between and within the literary movements.
  5. Appendix 2:

    • Graphs of the results for the complexity indices.
    • Graphs of the results for the cosine similarity measures.

About

Investigating the alignment of quantitative and qualitative literary analysis in American literature - UNITN

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published