NLP-Tutorial

Document Classification in Python

A tutorial showing how to leverage a few great libraries out there -- gensim and scikit-learn -- to not only perform document similarity queries, but document classification as well.

===== Files

corpus -- A directory of 4 tiny text files
.gitignore -- Files in repo for Git to ignore
classifier.py -- The main file that does everything
requirements.txt -- File used by pip to download dependencies

======== Download

All you need to do is clone the repo:

git clone https://github.com/Scripted/NLP-Tutorial

============ Dependencies

In a perfect world, running "pip install -r requirements.txt" should download all the dependencies necessary to run this code. Unfortunately, Numpy and Scipy don't always play nice with pip. So try "pip install -r requirements.txt" and if that doesn't work, check out the installation instructions on the modules' sites: Numpy , Scipy , Gensim , Scikit-Learn

======= Running

Easy enough:

python classifier.py

The output shows the various steps of the algorithm as it works.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP-Tutorial

===== Files

======== Download

============ Dependencies

======= Running

About

Releases

Packages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
corpus		corpus
.gitignore		.gitignore
README.md		README.md
classifier.py		classifier.py
requirements.txt		requirements.txt

shafiahmed/NLP-Tutorial

Folders and files

Latest commit

History

Repository files navigation

NLP-Tutorial

===== Files

======== Download

============ Dependencies

======= Running

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages