Skip to content

rforbiodatascience21/2021_group11_final_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 

Repository files navigation

Exam Project - README

This is the final project in course "22100 - R for Bio Data Science" by group 11 Authors: Freja Dahl Hede, Maika Jensen, Malene Nørregaard, Sofie Rossen, Sofie Theisen Honoré

The project is an explorative data analysis based on material from the paper:

Assessment of the influence of intrinsic environmental and geographical factors on the bacterial ecology of pit latrines. Belen Torondel, Jeroen H.J. Ensink, Ozan Gundogdu Umer Zeeshan, Ijaz Julian Parkhill, Faraji Abdelahi, Viet‐Anh Nguyen, Steven Sudgen, Walter Gibson, Alan W. Walker, Christopher Quince. Microbial Technology. 2016

The raw data can be accessed here and consists of latrine samples from Vietnam and Tanzania, along with environmental factors and bacterial content.

The project includes:

  • Data cleaning and wrangling
  • Initial Explorative Analysis with e.g. violin plots, correlation analysis and heatmap
  • Principal Component Analysis
  • KMeans Analysis
  • A Comparative Analysis with a t-test on the OTU Counts between Tanzania and Vietnam, an examination of the correlation between certain bacteria and environmental factors and examples of distributions

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages