Skip to content

Email spam detection program using Multinomial Naive Bayes and TF-IDF for text preprocessing and evaluation

Notifications You must be signed in to change notification settings

Mohamed1756/spamDetection

Repository files navigation

Email Spam detector

Email spam detection program using Multinomial Naive Bayes and TF-IDF for text preprocessing and evaluation

Screenshot 2023-08-24 at 18 36 53

Tools used;
• Python
• Apache Spam Assassian Dataset
• skLearn Machine learning library • Anaconda enviroment

Detection process.

• Load CSV file into a data frame
• Preprocess data - i.e remove headers, turn text into lowercase, remove Special Characters
• use the Term Frequency and the Naives Bayes library
• Split the dataset into training (80%) and testing (20%) of datasets.
• Then evaulate performance and test on actual spam emails to test accuracy.

Screenshot 2023-09-04 at 12 01 32

About

Email spam detection program using Multinomial Naive Bayes and TF-IDF for text preprocessing and evaluation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages