[ENH] Allow for correlations with regressors in decision tree #1008

eurunuela · 2023-11-24T10:12:53Z

Closes #1009.

Changes proposed in this pull request:

Adds the dec_correlation_higherthan_thresholds() function to selection_nodes.py, which allows the user to provide a tsv file with regressors in its column to consider during component classification. The function also expects the number of metric labels (e.g., ["visual task", "motor task"]) to match with the number of regressors (i.e., the number of columns in the regressors file).

codecov · 2023-11-24T16:02:25Z

Codecov Report

Attention: Patch coverage is 67.44186% with 14 lines in your changes missing coverage. Please review.

Project coverage is 89.25%. Comparing base (1c3f93e) to head (4f286e1).
Report is 63 commits behind head on main.

Files with missing lines	Patch %	Lines
tedana/selection/selection_nodes.py	63.15%	8 Missing and 6 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1008      +/-   ##
==========================================
- Coverage   89.54%   89.25%   -0.29%     
==========================================
  Files          26       26              
  Lines        3395     3434      +39     
  Branches      619      628       +9     
==========================================
+ Hits         3040     3065      +25     
- Misses        207      215       +8     
- Partials      148      154       +6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

handwerkerd · 2023-11-30T16:44:26Z

Thank you for starting to work on this and sharing so that it's possible to start discussing.

I do have one concern that I wanted to bring up sooner rather than later. The way you're approaching this now, you're effectively calculating a metric per component as part of a selection process. The advantage of this is that decision tree metrics can be tweaked using just the decision tree code, but the disadvantage is, if we start calculating metrics in the decision tree, then it's harder to understand what's happening using the decision tree outputs. That is, full columns would be added to the component table and nodes in the decision tree both make calculations and decisions. If I'm reading the draft code correctly, your not saving the fit metrics in the component table & that's important info to save.

My preference would be to add this code as a metric and then most of the decisions could be made just using dec_left_op_right. That would make it hard to play around with regressor fit metrics on the same components, but there are ways to address that issue. We can talk more about this later, but wanted to bring this up before you do a lot more work on this.

handwerkerd · 2023-11-30T16:57:29Z

Also @goodalse2019 and @n-reddy both mentioned some interest in working on this, so I want to make sure they both saw you've started.

eurunuela · 2023-12-01T12:38:25Z

That's a fair point @handwerkerd. I actually thought of using dec_left_op_right for this, but ended up building this other approach. It seemed to me the approach on this PR would be the easiest/quickest way to have a working version of it ready.

In any case, happy to discuss how we can adjust the code to use dec_left_op_right.

tsalo · 2024-01-09T19:03:38Z

I like the idea of creating a new metrics/external.py module with functions to correlate the component time series with the external ones. Then the metrics could be evaluated with dec_left_op_right in the decision tree.

handwerkerd · 2024-02-28T15:51:23Z

@eurunuela if you think the approach @tsalo and I are working on in #1021 is the one we'll move forward with, do you think this PR should be closed?

eurunuela · 2024-02-28T16:31:36Z

Absolutely. Let's close this one.

eurunuela added 5 commits November 24, 2023 11:09

First version of a correlation node

7baadee

Made it possible to use more than one regressor in a single tsv file

ab1bb17

Do not assume there is a single threshold

48c2c8b

Updated 5 echo test to test with regressors

126611c

Added docstring to new function

4f286e1

handwerkerd mentioned this pull request Jan 17, 2024

January 2024 Developers call #1015

Closed

tsalo mentioned this pull request Feb 7, 2024

Allow for correlations with regressors in decision tree #1021

Closed

5 tasks

eurunuela closed this Feb 28, 2024

handwerkerd mentioned this pull request Mar 20, 2024

Generate metrics from external regressors using F stats #1064

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] Allow for correlations with regressors in decision tree #1008

[ENH] Allow for correlations with regressors in decision tree #1008

eurunuela commented Nov 24, 2023 •

edited

Loading

codecov bot commented Nov 24, 2023 •

edited

Loading

handwerkerd commented Nov 30, 2023

handwerkerd commented Nov 30, 2023

eurunuela commented Dec 1, 2023

tsalo commented Jan 9, 2024

handwerkerd commented Feb 28, 2024

eurunuela commented Feb 28, 2024

[ENH] Allow for correlations with regressors in decision tree #1008

[ENH] Allow for correlations with regressors in decision tree #1008

Conversation

eurunuela commented Nov 24, 2023 • edited Loading

codecov bot commented Nov 24, 2023 • edited Loading

Codecov Report

handwerkerd commented Nov 30, 2023

handwerkerd commented Nov 30, 2023

eurunuela commented Dec 1, 2023

tsalo commented Jan 9, 2024

handwerkerd commented Feb 28, 2024

eurunuela commented Feb 28, 2024

eurunuela commented Nov 24, 2023 •

edited

Loading

codecov bot commented Nov 24, 2023 •

edited

Loading