Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pepXML modifications are offset by one #100

Closed
nnalpas opened this issue Oct 3, 2024 · 1 comment · Fixed by #104
Closed

pepXML modifications are offset by one #100

nnalpas opened this issue Oct 3, 2024 · 1 comment · Fixed by #104
Labels
bug Something isn't working
Milestone

Comments

@nnalpas
Copy link

nnalpas commented Oct 3, 2024

Hi,
I think there is an issue in parsing peptidoform for pepXML file.

in this peptide hit exemple :

<search_hit peptide="AHTMVHDQVSR" massdiff="-6.103515625E-4" calc_neutral_pep_mass="1295.604" peptide_next_aa="F" num_missed_cleavages="0" num_tol_term="2" protein_descr="gene=gltA;locus_tag=19A2747_02138;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P14165;product=Citrate synthase" num_tot_proteins="1" tot_num_ions="20" hit_rank="1" num_matched_ions="6" protein="19A2747_02138_gene" peptide_prev_aa="R" is_rejected="0">
<modification_info modified_peptide="AHTM[147.0354]VHDQVSR">
<mod_aminoacid_mass mass="147.0354" position="4"/>
</modification_info>
<search_score name="hyperscore" value="15.15"/>
<search_score name="nextscore" value="0.0"/>
<search_score name="expect" value="3.868121e-04"/>
</search_hit>

the psm_utils.io.read_file command returns:

AHTMV[+147.0354]HDQVSR/3

The oxidation(M) on position 4 is offset to position 5.

This might be due to the modification parsing occuring in the function "_parse_peptidoform"; specifically the line
sequence = [(aa, modifications_dict[i] or None) for i, aa in enumerate(peptide)]
I could be wrong but I think, this should be:
sequence = [(aa, modifications_dict[i+1] or None) for i, aa in enumerate(peptide)]

I hope this helps.
Thanks,

@RalfG RalfG closed this as completed in 8fd1299 Oct 31, 2024
RalfG added a commit that referenced this issue Oct 31, 2024
Fix peptidoform modification offset and mass (fixes #100)
@RalfG
Copy link
Member

RalfG commented Oct 31, 2024

Hi @nnalpas,

Thanks for letting us know! This will be fixed in the next version.

Best,
Ralf

@RalfG RalfG added the bug Something isn't working label Oct 31, 2024
@RalfG RalfG added this to the v1.2.0 milestone Oct 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants