Feel free to create pull requests, but do not commit subtitles !
To create a visualization :
- Extracts the subtitles using FFMPEG to the VTT format, due to obvious copyright problems, they can't be on the repository.
- Preprocess the image using a graphical tool to create a mask.
- Black: Word cloud space
- White: Kept as is from the image
- Grey value: Discarded from the visualization
- From this mask and the words obtained from the subtitles, the script uses nltk to remove stop words, wordcloud to create a visualization and a bit of numpy image math's.
- Cowboy Bebop
- Neon Genesis Evangelion
- Darling in the Franxx
- Mirai Nikki
- Death Note
- Steins;Gate
- One-Punch Man
Data used:
- English subtitles from : Cowboy Bebop (1998)
- Original image
Reddit posts: r/dataisbeautiful / r/cowboybebop
Data used:
- English subtitles from : Neon Genesis Evangelion (1995)
- Original image
Reddit posts : r/dataisbeautiful r/evangelion
Data used:
- English subtitles from : Darling in the Franxx (2018)
- Original image
Data used:
- English subtitles from : Mirai Nikki (2011)
- Original image
Data used:
- English subtitles from : Death Note (2006)
- Original image
Data used:
- English subtitles from : Steins;Gate (2009)
- Original image
Data used:
- English subtitles from : One-Punch Man season 1 (2015)
- Original image