This project aims to compare traditional Machine Learning methods for tabular data classification, such as Ensemble methods, Decision Trees, and Naive Bayes, with NLP classification methods like Multinomial Naive Bayes, RNNs, and Transformers. The comparison is conducted after converting tabular data into a text format. We are utilizing survey data from the CDC via the Behavioral Risk Factor Surveillance System (BRFSS), the United States' system for health-related telephone surveys. BRFSS collects state-level data on health-related risk behaviors such as smoking, chronic health conditions such as heart disease and diabetes, and preventive services utilization among U.S. residents.
-
Notifications
You must be signed in to change notification settings - Fork 0
This project aims to compare traditional Machine Learning methods for tabular data classification, such as Ensemble methods, Decision Trees, and Naive Bayes, with NLP classification methods like Multinomial Naive Bayes, RNNs, and Transformers. We are utilizing survey data from the CDC via the Behavioral Risk Factor Surveillance System (BRFSS)
License
GeorgeM2000/CDC-SMART-BRFSS-City-and-County-Data-Analysis
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
This project aims to compare traditional Machine Learning methods for tabular data classification, such as Ensemble methods, Decision Trees, and Naive Bayes, with NLP classification methods like Multinomial Naive Bayes, RNNs, and Transformers. We are utilizing survey data from the CDC via the Behavioral Risk Factor Surveillance System (BRFSS)
Topics
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published