GitHub - yuanboFaith/Lemon_Juice_Classification2: Assessment of lemon juice adulteration by UHPLC-QqQ-MS/MS with interactive and interpretable machine learning

Assessment of lemon juice quality and adulteration by ultra-high performance liquid chromatography/triple quadrupole mass spectrometry with interactive and interpretable machine learning

Check the original article published in Journal of Food and Drug Analysis.

Abstract

A total of 81 lemon juices samples were detected using an optimized UHPLC-QqQ-MS/MS method and colorimetric assays. Concentration of 3 organic acids (ascorbic acid, malic acid and citric acid), 3 saccharides (glucose, fructose and sucrose) and 6 phenolic acids (trans-p-coumaric acid, 3-hydroxybenzoic acid, 4-hydroxybenzoic acid, 3,4-dihydroxybenzoic acid, caffeic acid) were quantified. Their total polyphenol, antioxidant activity and Ferric reducing antioxidant power were also measured. For the prediction of authentic and adulterated lemon juices and commercially sourced lemonade beverages based on the acquired metabolic profile, machine learning models including linear discriminant analysis, Gaussian naïve Bayes, lasso-regularized logistic regression, random forest (RF) and support vector machine were developed based on training (70%)-cross-validation-testing (30%) workflow. The predicted accuracy on the testing set is 73–86% for different models. Individual conditional expectation analysis (how predicted probabilities change when the feature magnitude changes) was applied for model interpretation, which in particular revealed the close association of RF-probability prediction with nuance characteristics of the density distribution of metabolic features. Using established models, an open-source online dashboard was constructed for convenient classification prediction and interactive visualization in real practice.

Script Reference

The R script in this documentation covers data wrangling, visualization, machine learning modeling, and Shiny App construction developed in this original publication. Check here to find the script and associated output.

The R code has been developed with reference to R for Data Science (2e), and the official documentation of tidyverse, and DataBrewer.co. See breakdown of modules below:

Data visualization with ggplot2 (tutorial of the fundamentals; and data viz. gallery).
Data wrangling with the following packages: tidyr: transform (e.g., pivoting) the dataset into tidy structure; dplyr: the basic tools to work with data frames; stringr: work with strings; regular expression: search and match a string pattern; purrr: functional programming (e.g., iterating functions across elements of columns); and tibble: work with data frames in the modern tibble structure.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
ICEplots_files		ICEplots_files
LemonMLscript_files		LemonMLscript_files
site_libs		site_libs
.DS_Store		.DS_Store
.Rhistory		.Rhistory
.gitignore		.gitignore
ICEplots.Rmd		ICEplots.Rmd
ICEplots.html		ICEplots.html
LemonMLscript.Rmd		LemonMLscript.Rmd
LemonMLscript.html		LemonMLscript.html
README.md		README.md
Shiny lemon final data_CLEANED UP.csv		Shiny lemon final data_CLEANED UP.csv
Shiny_App_Script.Rmd		Shiny_App_Script.Rmd
Shiny_App_Script.html		Shiny_App_Script.html
_site.yml		_site.yml
background.css		background.css
index.Rmd		index.Rmd
index.html		index.html
lemonImage.jpg		lemonImage.jpg
token.rds		token.rds

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Assessment of lemon juice quality and adulteration by ultra-high performance liquid chromatography/triple quadrupole mass spectrometry with interactive and interpretable machine learning

Abstract

Script Reference

Follow me. Keep Updated with My Latest Research

About

Releases

Packages

Languages

yuanboFaith/Lemon_Juice_Classification2

Folders and files

Latest commit

History

Repository files navigation

Assessment of lemon juice quality and adulteration by ultra-high performance liquid chromatography/triple quadrupole mass spectrometry with interactive and interpretable machine learning

Abstract

Script Reference

Follow me. Keep Updated with My Latest Research

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages