Skip to content

Debugging class imbalance #3142

Answered by arnavgarg1
Overload119 asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @Overload119! Thanks for explaining the steps you took so clearly - it was useful and made it easy to follow along. I had a chance to use the same dataset and config and run a variety of tests and wanted to share those results with you.

  1. The snapshot of the dataset you added to Google Drive is balanced, with almost an exact 50-50 split between 1 and 0s. In case you balanced these datasets out manually by dropping rows from the majority class so that the majority and minority classes were equal, it may be an unfair representation of the true dataset (and what you will actually see in a production scenario when you're running inference against your trained model). Instead, I would sugge…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@tgaddair
Comment options

Answer selected by Overload119
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants