-
Notifications
You must be signed in to change notification settings - Fork 67
Planned data release: V5 #121
Comments
@jharenza we may want to split the expression data files up based on the RNA library based on @jashapiro 's results on #120 - notebook here |
ok - separate RDS files for FPKM from each algorithm, as well as counts? |
That seems like it would be the most flexible way to go as far as downstream options if it's straightforward to do. |
Ok, yeah once we merge, I can just separate into two RDS files. |
Thinking about this again - would it be easier for those working on RNA to do this separation using the clinical file? I ask because many other analyses using these files may have to re-combine (eg - fusion workflow will use both stranded and polyA and if we look for CNV deletion evidence on the basis of RNA expression <1 FPKM, we would recombine these files). I agree we should not cluster/perform certain analyses together, but trying to think of what makes more sense - for those specific analyses tainted by strand to separate or for us to separate all now and make sure other users know they may have to re-combine. Thoughts? |
I would still support separating the files, and having the users combine them as needed. This means that the user will have to explicitly acknowledge that they are combining two different data sets, and may give a slight bit of pause to consider whether that is appropriate. For example, the meaning of FPKM < 1 is likely to be different in the two data sets, and the using the same cutoffs may not be appropriate. |
@jashapiro - makes sense! We will plan on separating them then. |
@cgreene @jaclyn-taroni @jashapiro - we have the CHANGELOG and all files except the transcript counts file staged and ready to go - that merge is taking longer than anticipated, so @yuankunzhu will either create the PR without that file later tonight and add tomorrow or just create the whole PR tomorrow morning. |
Currently planned for 24-Sept-2019
Planned addition + changes from @jharenza :
The text was updated successfully, but these errors were encountered: