-
Notifications
You must be signed in to change notification settings - Fork 95
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ENH] Allow for correlations with regressors in decision tree #1008
Conversation
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #1008 +/- ##
==========================================
- Coverage 89.54% 89.25% -0.29%
==========================================
Files 26 26
Lines 3395 3434 +39
Branches 619 628 +9
==========================================
+ Hits 3040 3065 +25
- Misses 207 215 +8
- Partials 148 154 +6 ☔ View full report in Codecov by Sentry. |
Thank you for starting to work on this and sharing so that it's possible to start discussing. I do have one concern that I wanted to bring up sooner rather than later. The way you're approaching this now, you're effectively calculating a metric per component as part of a selection process. The advantage of this is that decision tree metrics can be tweaked using just the decision tree code, but the disadvantage is, if we start calculating metrics in the decision tree, then it's harder to understand what's happening using the decision tree outputs. That is, full columns would be added to the component table and nodes in the decision tree both make calculations and decisions. If I'm reading the draft code correctly, your not saving the fit metrics in the component table & that's important info to save. My preference would be to add this code as a metric and then most of the decisions could be made just using |
Also @goodalse2019 and @n-reddy both mentioned some interest in working on this, so I want to make sure they both saw you've started. |
That's a fair point @handwerkerd. I actually thought of using In any case, happy to discuss how we can adjust the code to use |
I like the idea of creating a new |
@eurunuela if you think the approach @tsalo and I are working on in #1021 is the one we'll move forward with, do you think this PR should be closed? |
Absolutely. Let's close this one. |
Closes #1009.
Changes proposed in this pull request:
dec_correlation_higherthan_thresholds()
function toselection_nodes.py
, which allows the user to provide atsv
file with regressors in its column to consider during component classification. The function also expects the number of metric labels (e.g.,["visual task", "motor task"]
) to match with the number of regressors (i.e., the number of columns in the regressors file).