Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
this donow was a good review of a lot of things -- like connecting to databases, splicing arrays, making decision tree classifiers, handling plots in matplotlib.
the most difficult thing for me is still connecting to databases, seamlessly. it requires some tinkering around before i know how to do it. i realized i am confused about whether 10 fold validation means 10 splits on the data that are then split into training and testing data sets OR five splits on the whole data set that are then split into training and testing.
i learned about .ravel() and the necessity of having your target array having only rows (x rows,) rather than being shaped like (x rows, 1) if you want to use cross_val_scores.