Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🐛 fix "-" as psm-fdr setting #320

Merged
merged 5 commits into from
Jul 12, 2024
Merged

🐛 fix "-" as psm-fdr setting #320

merged 5 commits into from
Jul 12, 2024

Conversation

enryH
Copy link
Member

@enryH enryH commented Jul 12, 2024

  • allow to have a string value for ProlineStudio as fdr (MBR has no FDR?)
  • add minimum and maximum of positives charges specified

@mlocardpaulet It would be great to have at some point a schema specifying allowed values per Software. But I guess the more we test, we will find out which ones are allowed...

Henry added 2 commits July 12, 2024 13:47
- allow to have a string value for ProlineStudio as fdr
@mlocardpaulet
Copy link
Contributor

Huummm... It is true that I don't see anywhere the value for FDR threshold. Anne-Marie indicated that the expected values are:

software tool name and version : Proline 2.2
search engine name and version : Mascot 2.8.3
peaklist generation software tool name and version : MGFBoost 3.6.3 (with eclean)
FDR threshold for PSM, peptide and protein level : 1% PSM
match between run (or not) : yes or no (depends on the file)
precursor mass tolerance : 10ppm
enzyme (although for these data it should be Trypsin) : Trypsin/P
number of missed-cleavages : 2
minimum and maximum peptide length : 7
fixed and variable modifications : cysteine CAM as fixed, methionin oxidation and Protein Nter acetylation as variable
maximum number of modifications : 7
minimum and maximum precursor charge : 2-5

But I don't know where to find these values. I'll ask her Monday.

@enryH
Copy link
Member Author

enryH commented Jul 12, 2024

Huummm... It is true that I don't see anywhere the value for FDR threshold. Anne-Marie indicated that the expected values are:

software tool name and version : Proline 2.2
search engine name and version : Mascot 2.8.3
peaklist generation software tool name and version : MGFBoost 3.6.3 (with eclean) # we don't support that yet
FDR threshold for PSM, peptide and protein level : 1% PSM 
match between run (or not) : yes or no (depends on the file)
precursor mass tolerance : 10ppm
enzyme (although for these data it should be Trypsin) : Trypsin/P
number of missed-cleavages : 2
minimum and maximum peptide length : 7
fixed and variable modifications : cysteine CAM as fixed, methionin oxidation and Protein Nter acetylation as variable
maximum number of modifications : 7
minimum and maximum precursor charge : 2-5

But I don't know where to find these values. I'll ask her Monday.

We don't support: peaklist generation software tool name and version anywhere

We definitely still have some blanks: https://github.com/Proteobench/ProteoBench/blob/fb4d835c43dcdc26d4870c24c9fb34005d5551ba/test/params/ProlineStudio_withMBR.csv

@mlocardpaulet
Copy link
Contributor

Huummm... It is true that I don't see anywhere the value for FDR threshold. Anne-Marie indicated that the expected values are:

software tool name and version : Proline 2.2
search engine name and version : Mascot 2.8.3
peaklist generation software tool name and version : MGFBoost 3.6.3 (with eclean) # we don't support that yet
FDR threshold for PSM, peptide and protein level : 1% PSM 
match between run (or not) : yes or no (depends on the file)
precursor mass tolerance : 10ppm
enzyme (although for these data it should be Trypsin) : Trypsin/P
number of missed-cleavages : 2
minimum and maximum peptide length : 7
fixed and variable modifications : cysteine CAM as fixed, methionin oxidation and Protein Nter acetylation as variable
maximum number of modifications : 7
minimum and maximum precursor charge : 2-5

But I don't know where to find these values. I'll ask her Monday.

We don't support: peaklist generation software tool name and version anywhere

We definitely still have some blanks: https://github.com/Proteobench/ProteoBench/blob/fb4d835c43dcdc26d4870c24c9fb34005d5551ba/test/params/ProlineStudio_withMBR.csv

Thanks. I'll discuss these "blanks" with Anne-Marie. I think that there are too many and it comes from missing information in their outputs.

Henry added 3 commits July 12, 2024 14:42
@enryH enryH marked this pull request as ready for review July 12, 2024 12:55
@enryH enryH merged commit f8597c5 into main Jul 12, 2024
8 checks passed
@enryH enryH deleted the proline_studio branch July 12, 2024 13:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants