[FIX] Normalize PCA mixing matrix over time, not component #228

tsalo · 2019-02-28T13:20:31Z

References #223. One of the concerns I brought up in #223 is that the normalized PCA mixing matrix, which is only used to calculate the weight maps (WTS) within fitmodels_direct, is normalized over component, rather than over time. This strikes me as invalid, though I could be misinterpreting the purpose of the normalization. This will not impact the MLE dimensionality estimation, but should improve the validity of the Kundu PCA decision tree.

Changes proposed in this pull request:

Z-score the PCA mixing matrix over time (per component), rather than over components (per TR).

codecov · 2019-03-01T12:14:54Z

Codecov Report

Merging #228 into master will increase coverage by 0.02%.
The diff coverage is 0%.

@@            Coverage Diff             @@
##           master     #228      +/-   ##
==========================================
+ Coverage   47.83%   47.86%   +0.02%     
==========================================
  Files          33       33              
  Lines        2013     2012       -1     
==========================================
  Hits          963      963              
+ Misses       1050     1049       -1

Impacted Files	Coverage Δ
tedana/decomposition/eigendecomp.py	`10.34% <0%> (+0.05%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 3c08b8a...54bf80e. Read the comment docs.

This reverts commit c5b0996.

This reverts commit 7c555cd.

emdupre · 2019-03-10T22:39:53Z

To the best of my understanding, the purpose of normalization was so that each components would be on the same scale when passed on to the PCA decision tree (and therefore could use e.g. the same thresholds). If we normalize by time, that assumption no longer holds.

Could you explain a little more why you were thinking it would make sense to normalize over time ?

tsalo · 2019-03-10T22:56:20Z

The old way was z-scoring each timepoint across components, while this fix z-scores each component across timepoints. The former approach doesn't just rescale the time series, but also changes them. Granted, the difference is small (the correlation for a random array with 80 components and 160 timepoints before vs. after z-scoring is ~0.99), but it should also not be a valid change to make, as far as I can tell.

emdupre · 2019-03-11T00:35:40Z

Ah, sorry, I think I misunderstood ! So to clarify (for me):

What is the correlation between the original components and this new normalization ? The 0.99 is with the old normalization, correct ?

I think we should be checking the PCA selection tree in the integration tests. This seems like as good a time as any, since I want to know how this is impacting the PCA selection. WDYT ? Should we add it to the three echo dataset ?

tsalo · 2019-03-11T01:39:28Z

The time series correlate perfectly after z-scoring the new way. Yeah, the old way gets 0.999.

That's a good idea. I can change the three-echo integration test in this PR.

emdupre · 2019-03-11T16:44:30Z

If this is correlating at 1.0 then I'm wondering if it's really a necessary normalization -- obviously removing the old one seems to be !

If we can fix the merge conflict (sorry, I think I pulled it in with #208 ) then this LGTM !

tsalo · 2019-03-11T17:12:57Z

It's used to generate the normalized version of the mixing matrix, which is used in fitmodels_direct. Hypothetically, the scale of the time series should impact the parameter estimates, and I believe that the goal of the z-scoring is to make the parameter estimates from computefeats2 equivalent to betas. This is definitely something worth discussing in the larger context of how metric calculation should be performed (e.g., in #178, #179, and #223), but if we work under the assumption that the current version of fitmodels_direct is generally correct, then this is still a bug that needs to be fixed.

emdupre · 2019-03-11T18:59:19Z

I think this is good to merge. It sounds like we need to do some issue clean-up around metric calculation -- we can deal with that after this is in :)

Normalize PCA mixing matrix over time, not component.

eefac0e

ME-ICA deleted a comment from codecov bot Feb 28, 2019

tsalo mentioned this pull request Mar 1, 2019

[ENH] Automatically use Nilearn's EPI mask when no explicit mask is provided #226

Merged

ME-ICA deleted a comment from codecov bot Mar 1, 2019

Print stuff for debugging.

7c555cd

tsalo added 3 commits March 1, 2019 07:40

Fix.

c5b0996

Revert "Fix."

d124693

This reverts commit c5b0996.

Revert "Print stuff for debugging."

3a71d62

This reverts commit 7c555cd.

ME-ICA deleted a comment from codecov bot Mar 2, 2019

tsalo requested a review from emdupre March 6, 2019 14:08

Use decision tree for TEDPCA in three-echo test.

24dfac1

Merge branch 'master' into fix-pca

fef40fc

Fix.

54bf80e

emdupre approved these changes Mar 11, 2019

View reviewed changes

tsalo merged commit 5e32260 into ME-ICA:master Mar 11, 2019

emdupre mentioned this pull request Mar 11, 2019

[FIX] scatter plot labeling issue. #235

Merged

tsalo deleted the fix-pca branch March 11, 2019 19:05

tsalo mentioned this pull request May 30, 2019

Concerns regarding TE-(in)dependence metric calculation #223

Closed

tsalo added the output-change label Nov 20, 2020

tsalo mentioned this pull request Nov 20, 2020

Examine differences in results between original MEICA & current tedana #622

Open

tsalo mentioned this pull request Jan 15, 2021

Should data be normalized along time or voxels? #653

Closed

jbteves added breaking change WIll make a non-trivial change to outputs and removed output-change labels Apr 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX] Normalize PCA mixing matrix over time, not component #228

[FIX] Normalize PCA mixing matrix over time, not component #228

tsalo commented Feb 28, 2019 •

edited

Loading

codecov bot commented Mar 1, 2019 •

edited

Loading

emdupre commented Mar 10, 2019

tsalo commented Mar 10, 2019

emdupre commented Mar 11, 2019

tsalo commented Mar 11, 2019

emdupre commented Mar 11, 2019

tsalo commented Mar 11, 2019

emdupre commented Mar 11, 2019

[FIX] Normalize PCA mixing matrix over time, not component #228

[FIX] Normalize PCA mixing matrix over time, not component #228

Conversation

tsalo commented Feb 28, 2019 • edited Loading

codecov bot commented Mar 1, 2019 • edited Loading

Codecov Report

emdupre commented Mar 10, 2019

tsalo commented Mar 10, 2019

emdupre commented Mar 11, 2019

tsalo commented Mar 11, 2019

emdupre commented Mar 11, 2019

tsalo commented Mar 11, 2019

emdupre commented Mar 11, 2019

tsalo commented Feb 28, 2019 •

edited

Loading

codecov bot commented Mar 1, 2019 •

edited

Loading