Make MA-PCA criteria curves more accessible to users #834

handwerkerd · 2022-01-17T18:27:39Z

Summary

The number of PCA components is defined using one of several methods. A recent neurostars question and an issue I've been examining would have been easier to answer if the criteria curves used to calculate the number of components were more accessible. I propose outputting these curves either by default or when the --verbose option is used.

Next Steps

@eurunuela has already made these values slightly more accessible in this PR: Make criteria curves accessible as part of the mapca object mapca#43
Have that PR also return either the full PCA object or the three criteria curves so that tedana can access that information
Add code after this line to save the numbers in the criteria curves for aic, kic, and mdl.

eurunuela · 2022-01-17T19:43:53Z

Have that PR also return either the full PCA object or the three criteria curves so that tedana can access that information

Do you think adding a "selection" array or dictionary would be enough? It could be something like mapca.selection_ = {"aic": 67, "kic": 56, "mdl": 36}.

Add code after this line to save the numbers in the criteria curves for aic, kic, and mdl.

I personally think we should save the plot and log the selection numbers.

tsalo · 2022-01-18T15:19:21Z

I think we should leverage the "classification" and "rationale" columns of the PCA component table, and retain the full PCA mixing matrix. When we shift from "rationale" to "tags", we can also add "aic_accepted", "kic_accepted", and "mdl_accepted" tags, right?

We could of course also add AIC, KIC, and MDL to the PCA table's metrics.

handwerkerd · 2022-01-20T03:11:59Z

There are a bunch of single values that should be saved in a clear location. I'd add % variance explained by the PCA to the three selection criteria.
FWIW, the decistion tree modularization has a dictionary & saved file called cross_component_metrics I could see making a new file called PCA_cross_component_metrics now and then the decision tree's file would become ICA_cross_component_metrics

I also think we should be saving the curves and not just the aic/kic/mdl thresholds. Those curves were important for diagnosing issues, so if we're saving something, they should be accessible.

The full PCA mixing matrix would be a fairly large file. If we're saying it, it should definitely only be saved with the --verbose option.

eurunuela mentioned this issue Feb 8, 2022

Print optimal number of maPCA components and plot optimization curves #839

Merged

4 tasks

dowdlelt mentioned this issue Mar 17, 2022

Make kundu tedpca selection results more accessible #860

Open

3 tasks

eurunuela self-assigned this May 17, 2022

eurunuela closed this as completed in #839 May 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make MA-PCA criteria curves more accessible to users #834

Make MA-PCA criteria curves more accessible to users #834

handwerkerd commented Jan 17, 2022

eurunuela commented Jan 17, 2022

tsalo commented Jan 18, 2022 •

edited

Loading

handwerkerd commented Jan 20, 2022

Make MA-PCA criteria curves more accessible to users #834

Make MA-PCA criteria curves more accessible to users #834

Comments

handwerkerd commented Jan 17, 2022

Summary

Next Steps

eurunuela commented Jan 17, 2022

tsalo commented Jan 18, 2022 • edited Loading

handwerkerd commented Jan 20, 2022

tsalo commented Jan 18, 2022 •

edited

Loading