Matching compound names from GNPS to MiBIG #3

sdrogers · 2018-10-25T19:48:30Z

This is a bit of a mess (and will be always I think).
We could store a dictionary of the names that appear in GNPS (LibraryID) which seem to be slightly different from the ones that are in the GNPS library MGF files.
Also need to be able to handle the dereplicator results that appear in the LibraryID column and dereplicator results from dereplicator (should be cleaner)
Also, varquest...etc

justinjjvanderhooft · 2018-10-25T20:03:48Z

Indeed, over the coming months I will try to validate links within the iOMEGA project but it will remain tricky indeed. Also, I can add SMILES to GNPS library IDs but will need to find some time to do it as we will need to double check the identifications....

justinjjvanderhooft · 2018-10-25T20:03:58Z

Indeed, over the coming months I will try to validate links within the iOMEGA project but it will remain tricky indeed. Also, I can add SMILES to GNPS library IDs but will need to find some time to do it as we will need to double check the identifications....

sdrogers · 2018-10-25T20:04:55Z

It seems though that GNPS doesn't necessarily provide the "IDs" - in the Crusemann file I'm working from it has compound names...IDs would be more helpful.

sdrogers · 2018-10-25T20:05:21Z

Also, Inchikey maybe better than smiles (or both)

justinjjvanderhooft · 2018-10-25T20:10:42Z

What do you mean with IDs? I meant the compound names but they are not always completely unambiguous....

sdrogers · 2018-10-25T20:12:17Z

The gnps library spectra have official IDs (CCMSXXXXXXXX) but this isn't in the output. In the output is the name "Staurosporine" which is also not identical to the names in the GNPS library MGF file (where the adduct is also present Staurosporine M+H or something

sdrogers · 2018-10-25T20:13:11Z

I suspect we'll end up with a method that just tries lots of ways of comparing the two names together

justinjjvanderhooft · 2018-10-25T20:18:48Z

Got it - you are right. Better to communicate with InchiKeys and SMILES/SMARTS....

CunliangGeng · 2022-06-29T12:19:41Z

I assume this issue has been solved, please reopen it if not.

CunliangGeng closed this as completed Jun 29, 2022

CunliangGeng added the data-format issues related to format of data label Jun 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Matching compound names from GNPS to MiBIG #3

Matching compound names from GNPS to MiBIG #3

sdrogers commented Oct 25, 2018

justinjjvanderhooft commented Oct 25, 2018

justinjjvanderhooft commented Oct 25, 2018

sdrogers commented Oct 25, 2018

sdrogers commented Oct 25, 2018

justinjjvanderhooft commented Oct 25, 2018

sdrogers commented Oct 25, 2018

sdrogers commented Oct 25, 2018

justinjjvanderhooft commented Oct 25, 2018

CunliangGeng commented Jun 29, 2022

Matching compound names from GNPS to MiBIG #3

Matching compound names from GNPS to MiBIG #3

Comments

sdrogers commented Oct 25, 2018

justinjjvanderhooft commented Oct 25, 2018

justinjjvanderhooft commented Oct 25, 2018

sdrogers commented Oct 25, 2018

sdrogers commented Oct 25, 2018

justinjjvanderhooft commented Oct 25, 2018

sdrogers commented Oct 25, 2018

sdrogers commented Oct 25, 2018

justinjjvanderhooft commented Oct 25, 2018

CunliangGeng commented Jun 29, 2022