Skip to content

Latest commit

 

History

History
23 lines (13 loc) · 401 Bytes

README.md

File metadata and controls

23 lines (13 loc) · 401 Bytes

Classifying Genetic mutation from high-dimensional clinical features

  1. Packages

  2. Downloading data

  3. Data Preprocessing

  4. Primary data analysis

  5. Data featurization

    1. Text Feature
    2. Categorical Features
  6. Secondary data analysis

    1. Visualizing high dimensional text features by t-SNE
    2. Visualizing Gene & Variation features by t-SNE
  7. ML model building

  8. Conclusion