Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chopping long reference names with --split-prefix #394

Closed
aroelo opened this issue May 8, 2019 · 3 comments
Closed

Chopping long reference names with --split-prefix #394

aroelo opened this issue May 8, 2019 · 3 comments

Comments

@aroelo
Copy link

aroelo commented May 8, 2019

While using minimap2 with the --split-prefix parameter I found that long reference names randomly get chopped in the header of the resulting sam file.

This is obviously a problem, because the reference names sometimes get chopped after even the first character and therefore it is impossible to tell for reads to which reference they were actually mapped. This problem doesn't occur when leaving out the --split-prefix option.

For me it would be great to be able to use the long reference names, since I want to include a species description in the name.

I added an example reference, reads and resulting sam file below.
The sam file was created with minimap2 version 2.14-r883:
minimap2 -ax map-ont long_ref_headers.fa seq_C11737035 --split-prefix string > long_headers.sam

long_ref_headers
seq_C11737035
long_headers

@lh3
Copy link
Owner

lh3 commented May 10, 2019

Do you have test data?

@aroelo
Copy link
Author

aroelo commented May 10, 2019

Yes, please see the example files attached.

@lh3
Copy link
Owner

lh3 commented May 10, 2019

Thanks! I overlooked that.

@lh3 lh3 added this to the 2.18 milestone May 10, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants