WeatherRaptor: Spark image machine learning CMPT 318 Project

Prerequisites:

Spark installed (2.2+)
Environment variables set: HADOOP_PATH, HADOOP_HOME, SPARK_PATH, SPARK_PATH, PYSPARK_DRIVER_PYTHON, PYSPARK_PYTHON, PYSPARK_WORKER_PYTHON, SPARK_LOCAL_IP
Python 3.4+ installed
Hadoop + HDFS installed
Weather data in yvr-weather
Image data in katkam-scaled with filenames of format katkam-YYYYMMDDHH0000.jpg

How to run:

To run, simply run `./run.sh`

There are various arguments you can pass:

--no-color --clean-dfs --no-setup --no-clean-images --no-clean-weather --no-analyze --analyze-tides

The commands are not mutually exclusive, everything with "--no-" prepended will be run by default, other commands will be run on top of the other functions.

Explanations:

--no-color:

- Run the analysis in Greyscale

--clean-dfs:

- Delete the files we created on HDFS (but still run everything else as explained above)

--no-setup:

- Do not load 318 module or install required packages (pip)

--no-clean-images:

- Do not clean the images

--no-clean-weather:

- Do not clean the weather data

--no-analyze:

- Do not run the analysis

--analyze-tides:

- Run the tide-specific analysis instead of the regular analysis

Example: To do analysis only on RGB

    - ./run.sh --no-setup --no-clean-images --no-clean-weather

Name		Name	Last commit message	Last commit date
Latest commit History 141 Commits
deprecated-neural-network-more-features		deprecated-neural-network-more-features
tide-folder		tide-folder
yvr-weather		yvr-weather
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
analysis.py		analysis.py
deep_learning_tensorflow.py		deep_learning_tensorflow.py
header		header
lookdata.py		lookdata.py
pair_images_by_time.py		pair_images_by_time.py
run.sh		run.sh
schema		schema
test.py		test.py
tide_data_analysis.py		tide_data_analysis.py
tide_data_clean.py		tide_data_clean.py
txtfile.txt		txtfile.txt
weather_parse.py		weather_parse.py
weather_setup.py		weather_setup.py
write_katkam_json.py		write_katkam_json.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WeatherRaptor: Spark image machine learning CMPT 318 Project

Prerequisites:

How to run:

To run, simply run `./run.sh`

There are various arguments you can pass:

About

Releases

Packages

Contributors 2

Languages

hgdsraj/318FinalProject

Folders and files

Latest commit

History

Repository files navigation

WeatherRaptor: Spark image machine learning CMPT 318 Project

Prerequisites:

How to run:

To run, simply run ./run.sh

There are various arguments you can pass:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

To run, simply run `./run.sh`

Packages