Skip to content

This repository provides a guide to preprocess and analyze the network intrusion data set using NumPy, Pandas, and matplotlib, and implement a random forest classifier machine learning model using Scikit-learn.

Notifications You must be signed in to change notification settings

jagdish4501/Network-intrusion-Detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Network Intrusion Detection System

Developing a packet capture module:

This module captures network packets real-time and saves them to a PCAP file. This module should be able to capture packets from multiple network interfaces simultaneously

Implementing a packet parsing module:

This module reads PCAP files and parses network packets into a format that can be used by our machine learning model.

Machine Learning Algorithm

This repository contains an analysis of the network intrusion data set using a different classifier machine learning algorithm. The data set has been preprocessed and analyzed using NumPy, Pandas, and Plotly to understand the underlying patterns and relationships in the data. The random forest classifier has been implemented using Scikit-learn to predict network intrusion based on the available features.

The repository includes Jupyter notebooks that showcase the data preprocessing steps, exploratory data analysis, and the machine learning model implementation. The notebooks are documented with markdown cells to explain the thought process and reasoning behind each step.

The data preprocessing steps include handling missing values, dealing with categorical variables, and scaling the features. Exploratory data analysis is done using Plotly to create interactive visualizations that help to understand the structure of the data. The machine learning model is trained on the preprocessed data and evaluated using various performance metrics.

The goal of this repository is to provide a comprehensive guide to preprocessing and analyzing the network intrusion data set and implementing a random forest classifier machine learning model. It is intended for anyone interested in learning about data preprocessing and machine learning using Python libraries such as NumPy, Pandas, Plotly, and Scikit-learn.

Tech Stack

ML Model: numpy, pandas, plotly ,sklearn

Other Tool and library used: npcap, libcap, scapy, Apache Hadoop ( a distributed computing framework)

About

This repository provides a guide to preprocess and analyze the network intrusion data set using NumPy, Pandas, and matplotlib, and implement a random forest classifier machine learning model using Scikit-learn.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published