Skip to content

Project to predict Rain in Australia using Machine Learning. The datasets which is used is large hence Big Data Concepts and techniques have been used. Pyspark is the tool used and MongoDB is used as database

Notifications You must be signed in to change notification settings

sohail-sankanur/Machine-Learning-to-Predict-Rain

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Machine-Learning-to-Predict-Rain

Rain Prediction is a universal problem among data scientist and statisticians. Over the past 10 years there is no perfect approach as to find the possibility of rain using technology. A tool to predict rain has infinite uses. It could be used by farmers and also by Aviation industry.

After doing extensive research on the topic and finding the dataset I have laid out what I think the best approach for predicting rain using consepts of Big Data and Machine Learning.

In this project I have used Machine Learning to predict the possibility of Rain in Australia on a given day.

As the data is pretty big I have used consepts of big data in this project. Pyspark is the main tool used for management of data and analysis and MongoDB is used as the database.

In the Machine Learning part I have used multiple Algorithms and a comparision of performance of all the Algorithms have been done.

The Algorithms used are:

  • GBT Classifier
  • Logistic Regression
  • Random Forest
  • Decision Tree

Python3 is used for this project. I have detailed out explaination as to what is done and each step is explained elaborately in the Jupyter Notebook.

About

Project to predict Rain in Australia using Machine Learning. The datasets which is used is large hence Big Data Concepts and techniques have been used. Pyspark is the tool used and MongoDB is used as database

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published