Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix/update_IDs #220

Merged
merged 21 commits into from
May 20, 2020
Merged

Fix/update_IDs #220

merged 21 commits into from
May 20, 2020

Conversation

eiden309
Copy link
Contributor

@eiden309 eiden309 commented May 16, 2020

Main improvements in this PR:

1. After manual curation of deprecated IDs and selected unbalanced reactions (details found in modifyID.tsv), used script modifyID.m to:

  • Update 63 rxnMetaNetXIDs (MNXRIDs), 5 rxnKEGGIDs, 38 metMetaNetXIDs (MNXMIDs), 1 metKEGGID and 12 metChEBIIDs
  • Add 15 MNXRIDs, 10 rxnKEGGIDs, 90 MNXMIDs, 3 metKEGGIDs and 12 metChEBIIDs
  • Remove 1 MNXRID and 2 rxnKEGGIDs

*Numbers reported corresponds to the number of unique IDs updated, added or removed
**Addresses #215 metabolite and reaction annotation mistake, as well as #205 KEGG reaction annotation issue (r_0198, r_0911, r_4245, r_2116, r_4254 and r_4255)

Summary of issues resolved for #205 :
Reactions Notes
r_0198 edited as suggested in #205 - changed model.rxnKEGGID to R00028
r_0911 With reference to module M00048, there is another pathway R07404 and R07405 which requires ATP as a substrate. R04209 has been removed from r_0911 since it is an incorrect reaction annotation. However, this remains an issue as there is no way of incorporating 2 rxnKEGGIDs (R07404 and R07405) into the model.
r_4245 R09645 removed from the model since it cannot be found in KEGG, as mentioned in #205.
r_2116 edited as suggested in #205 - changed model.rxnKEGGID to R00710
r_4254 changed model.rxnKEGGID to R05724 and model.rxnName to nitric oxide, NADH2:oxygen oxidoreductase
r_4255 changed model.rxnName to nitric oxide, NADPH2:oxygen oxidoreductase

2. mapKEGGID.m script written to search for new rxnKEGGIDs and metKEGGIDs through ID mapping, added 134 unique rxnKEGGIDs and 15 unique metKEGGIDs
*Note that MNXref version 3.2 is used

3. mapMNXMID.m script written to search for new metMNXMIDs through ID mapping, added 52 unique MNXMIDs. After new MNXMIDs are added, mapKEGGID.m is run again to add 6 unique metKEGGIDs
*Note that MNXref version 3.2 is used
**Addresses #197 lack of MNXMID annotations for some metabolites

I hereby confirm that I have:

  • Tested my code with all requirements for running the model
  • Selected devel as a target branch (top left drop-down menu)
  • If needed, asked first in the Gitter chat room about this PR

…X and KEGG)

New reaction IDs are found during manual curation of reactions in the model. These IDs either replace the current model IDs or are added into the model (if no IDs were originally present).

Inappropriately annotated or deprecated IDs have also been removed.

Details of the changes, such as the rationale for the addition of IDs or changes of IDs are recorded in modifyID.tsv, under the various notes.
…aNetX, KEGG and ChEBI)

New metabolite IDs are found during manual curation of metabolites in the model. These IDs either replace the current model IDs or are added into the model (if no IDs were originally present).

Details of the changes, such as the rationale for the addition of IDs or changes of IDs are recorded in modifyID.tsv, under the various notes.
With the function mapIDsViaMNXref from RAVEN, model.rxnMetaNetXID is mapped as follows:

1) rxnMetaNetXID -> rxnKEGGID
2) rxnMetaNetXID -> rxnMetaCycID -> rxnKEGGID
3) rxnMetaNetXID -> rxnSEEDID -> rxnKEGGID

Total unique rxnKEGGID added: 134
With the function mapIDsViaMNXref from RAVEN, model.metMetaNetXID is mapped as follows:

1) metMetaNetXID -> metKEGGID
2) metMetaNetXID -> metMetaCycID -> metKEGGID
3) metMetaNetXID -> metSEEDID -> metKEGGID

Total unique metKEGGID added: 15
model.metChEBIID and model.metKEGGID is mapped to find new MNXMID(s).

New MNXMID(s) will only be added if 1 of the following 3 criteria is met:
   a) Both metKEGGID and metChEBIID is mapped to the same MNXMID
   b) Only metKEGGID mapped to MNXMID, metFormula of new MNXMID matches model.metFormula
   c) Only metChEBIID mapped to MNXMID, metFormula of new MNXMID matches model.metFormula

Total unique MNXMIDs added: 52
@eiden309 eiden309 added enhancement new field/feature bug something is wrong in the model labels May 16, 2020
@eiden309 eiden309 requested review from edkerk and feiranl May 16, 2020 06:03
…yID.tsv

Added lines 135 – 137, 168 – 170, 203 – 205 to remove metabolite ID annotation in model, based on modifyID.tsv.

Minor correction made to modifyID.tsv:
1) Remove unnecessary metabolite and alternative metabolite ID entries which causes duplicated or lengthy notes
2) Correct errors generated when importing deprecated IDs into modifyID.tsv.
Add model.metNotes and model.rxnNotes with correct annotation PR #220 instead of PR #198
@edkerk edkerk self-assigned this May 16, 2020
README.md Outdated Show resolved Hide resolved
ComplementaryScripts/modelCuration/modifyID.m Outdated Show resolved Hide resolved
ComplementaryScripts/modelCuration/modifyID.m Outdated Show resolved Hide resolved
ComplementaryScripts/modelCuration/modifyID.m Outdated Show resolved Hide resolved
ComplementaryScripts/modelCuration/modifyID.m Outdated Show resolved Hide resolved
ComplementaryScripts/modelCuration/modifyID.m Outdated Show resolved Hide resolved
ComplementaryScripts/modelCuration/modifyID.m Outdated Show resolved Hide resolved
ComplementaryScripts/modelCuration/modifyID.m Outdated Show resolved Hide resolved
ComplementaryScripts/modelCuration/modifyID.m Outdated Show resolved Hide resolved
ComplementaryScripts/modelCuration/modifyID.m Outdated Show resolved Hide resolved
ComplementaryScripts/modelCuration/modifyID.m Outdated Show resolved Hide resolved
eiden309 and others added 9 commits May 18, 2020 01:51
Co-authored-by: Eduard Kerkhoven <eduardk@chalmers.se>
Co-authored-by: Eduard Kerkhoven <eduardk@chalmers.se>
Co-authored-by: Eduard Kerkhoven <eduardk@chalmers.se>
New mapKEGGID.m is able to check whether MetaCyc or modelSEED data is downloaded/imported successfully, and clear warning message will be displayed if download/import is unsuccessful.
This reverts commits 029b34d, 77906f8 and 682263d
Will be applied in separate branch
@edkerk
Copy link
Member

edkerk commented May 19, 2020

To separate two different issues, I have reverted the EOL and .gitattributes changes (f28c99f), as these are applied in a separate PR #221.

Copy link
Member

@edkerk edkerk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good from my side. @feiranl is also running the code to confirm reproducibility.

Copy link
Contributor

@BenjaSanchez BenjaSanchez left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

also looks good from my side, and the model changes don't conflict with the work in #188 (close to finishing my work there)

Copy link
Collaborator

@feiranl feiranl left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great! I don't have more comments!

@BenjaSanchez BenjaSanchez merged commit 14a3b7b into devel May 20, 2020
@BenjaSanchez BenjaSanchez deleted the fix/update_IDs branch May 20, 2020 13:04
This was referenced May 27, 2020
This was referenced Jun 12, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug something is wrong in the model enhancement new field/feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants