Skip to content

Latest commit

 

History

History
90 lines (54 loc) · 3.15 KB

README.md

File metadata and controls

90 lines (54 loc) · 3.15 KB

Information

Feel free to create pull requests, but do not commit subtitles !

To create a visualization :

  1. Extracts the subtitles using FFMPEG to the VTT format, due to obvious copyright problems, they can't be on the repository.
  2. Preprocess the image using a graphical tool to create a mask.
    • Black: Word cloud space
    • White: Kept as is from the image
    • Grey value: Discarded from the visualization
  3. From this mask and the words obtained from the subtitles, the script uses nltk to remove stop words, wordcloud to create a visualization and a bit of numpy image math's.

List

  1. Cowboy Bebop
  2. Neon Genesis Evangelion
  3. Darling in the Franxx
  4. Mirai Nikki
  5. Death Note
  6. Steins;Gate
  7. One-Punch Man

Cowboy Bebop

Data used:

Reddit posts: r/dataisbeautiful / r/cowboybebop

Neon Genesis Evangelion

Data used:

Reddit posts : r/dataisbeautiful r/evangelion

Darling in the Franxx

Data used:

Mirai Nikki

Data used:

Death Note

Data used:

Steins;Gate

Data used:

One-Punch Man

Data used: