Spam-Classifier

This is an implementation of the Naive Bayes Classification technique as a Spam Classifier.
Coded from the scratch in Python.

Classifier Logic:

Scan all mails to get the top 3000 most used words.
Convert each mail into a feature matrix based on this.
Calculate the summary of these top 3000 words for each label.
For a given mail calculate the log gaussian probability for each class.
Label with the highest probability wins.

Each version has it's own branch.
Master is the latest version. (And possibly under development.)

~~Working on making this project more generic.~~
No more changes to the actual classifier logic will be done.
Future plans for this project include creation of a Flask based APIs that will:

Trigger creation of the class summary.
Read test emails from a specified location.

Version: V02_02

Add docstring and comments.
Optimised variable usage.
Fixed bugs.
Better logging.

Version: V02_01

Currently classification works in the specified set of mails.
Modularised source code into separate files.
Removed hard coded paths. (config.txt)

Version: V01_01:

Single script for preprocessing, training, testing.
Simple implementation I did as a part of class project during my masters degree.

Reference:

https://github.com/savanpatel
https://medium.com/machine-learning-101/chapter-1-supervised-learning-and-naive-bayes-classification-part-2-coding-5966f25f1475

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
lib		lib
test-mails		test-mails
train-mails		train-mails
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spam-Classifier

Classifier Logic:

Version: V02_02

Version: V02_01

Version: V01_01:

Reference:

About

Releases

Packages

Languages

muksiddheswar/Spam-Classifier

Folders and files

Latest commit

History

Repository files navigation

Spam-Classifier

Classifier Logic:

Version: V02_02

Version: V02_01

Version: V01_01:

Reference:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages