-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Preprocessing benchmarking #337
Changes from 5 commits
702da0c
24b8af0
3bf9575
0e05872
7482a04
2428e9e
e5c6377
4a7389e
1a17cc1
9941fb5
e839fe8
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,9 @@ | ||
pbmc_data,https://www.dropbox.com/s/brj3orsjbhnhawa/5k.zip?dl=0 | ||
mouse_embryo_data,https://www.dropbox.com/s/8ftx1bydoy7kn6p/GSE65525.zip?dl=0 | ||
mouse_brain_data,https://www.dropbox.com/s/zzpotaayy2i29hk/neuron_10k.zip?dl=0 | ||
human_stemcell_data,https://www.dropbox.com/s/g2qua2j3rqcngn6/GSE75748.zip?dl=0 | ||
pbmc_raw_data,https://www.dropbox.com/s/brj3orsjbhnhawa/5k.zip?dl=0 | ||
human_breast_TGFb_data,https://dl.dropboxusercontent.com/scl/fi/qympicswl7slkksbjk7cp/GSE114397.zip?dl=0 | ||
human_breast_Dox_data,https://dl.dropboxusercontent.com/scl/fi/f2ifl5druqjr2rji8h4qq/GSM3141014.zip?dl=0 | ||
human_melanoma_data,https://dl.dropboxusercontent.com/scl/fi/ci9ihqytb4sys3u4xkdbq/human_melanoma_data.zip?dl=0 | ||
mouse_visual_data,https://dl.dropboxusercontent.com/scl/fi/yic1iwhh7a3gp6njyk0nf/mouse_visual_data.zip?dl=0 |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,54 @@ | ||
import os.path as osp | ||
|
||
IMPUTATION_DATASET_TO_FILE = { | ||
"pbmc_data": | ||
"5k_pbmc_protein_v3_filtered_feature_bc_matrix.h5", | ||
"mouse_embryo_data": [ | ||
osp.join("GSE65525", i) for i in [ | ||
"GSM1599494_ES_d0_main.csv", | ||
"GSM1599497_ES_d2_LIFminus.csv", | ||
"GSM1599498_ES_d4_LIFminus.csv", | ||
"GSM1599499_ES_d7_LIFminus.csv", | ||
] | ||
], | ||
"mouse_brain_data": | ||
"neuron_10k_v3_filtered_feature_bc_matrix.h5", | ||
"human_stemcell_data": | ||
"GSE75748/GSE75748_sc_time_course_ec.csv.gz", | ||
"human_breast_TGFb_data": | ||
"GSE114397_HMLE_TGFb.csv", | ||
"human_breast_Dox_data": | ||
"GSM3141014_Zeb1_Dox.csv", | ||
"human_melanoma_data": | ||
"human_melanoma_data.csv", | ||
"mouse_visual_data": [ | ||
'GSM2746905_B4_11_0h_counts.csv', | ||
# 'GSM2746906_B4_12_0h_counts.csv', | ||
# 'GSM2746922_B7_23_4h_B_counts.csv', | ||
# 'GSM2746895_B1_1_0h_counts.csv', | ||
# 'GSM2746916_B6_20_4h_A_counts.csv', | ||
# 'GSM2746903_B3_9_4h_counts.csv', | ||
# 'GSM2746914_B6_19_4h_A_counts.csv', | ||
# 'GSM2746908_B5_14_0h_counts.csv', | ||
# 'GSM2746907_B5_13_0h_counts.csv', | ||
# 'GSM2746917_B6_20_4h_B_counts.csv', | ||
# 'GSM2746918_B7_21_1h_counts.csv', | ||
# 'GSM2746898_B2_4_1h_counts.csv', | ||
# 'GSM2746909_B5_15_0h_counts.csv', | ||
# 'GSM2746915_B6_19_4h_B_counts.csv', | ||
# 'GSM2746897_B1_3_4h_counts.csv', | ||
# 'GSM2746902_B3_8_1h_counts.csv', | ||
# 'GSM2746911_B6_17_1h_A_counts.csv', | ||
# 'GSM2746904_B3_10_4h_counts.csv', | ||
# 'GSM2746900_B3_6_0h_counts.csv', | ||
# 'GSM2746920_B7_22_4h_B_counts.csv', | ||
# 'GSM2746896_B1_2_1h_counts.csv', | ||
# 'GSM2746921_B7_23_4h_A_counts.csv', | ||
# 'GSM2746899_B3_5_0h_counts.csv', | ||
# 'GSM2746919_B7_22_4h_A_counts.csv', | ||
# 'GSM2746901_B3_7_1h_counts.csv', | ||
# 'GSM2746910_B5_16_0h_counts.csv', | ||
# 'GSM2746912_B6_17_1h_B_counts.csv', | ||
'GSM2746913_B6_18_1h_counts.csv' | ||
] | ||
} |
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Hi @xingzhongyu, thanks for the PR! Could you provide a bit more context about why this edit ( There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Because the loop uses enumerate, the resulting tuple needs to be taken out of the following content, otherwise report an error |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@xingzhongyu could you help make some preliminary efforts to migrate the current download functions for the imputation dataset to use the downloading utilities in
dance/utils/download.py
like in the clustering dataset class? Imputation dataset is currently the only dataset that useswget
andcurl
(for historical reasons that are no longer relevant now).There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
OK I will finish the task in this week