bed-style format methylation file

nanopore-methylation-utilities

Set of utilities for analyzing nanopore methylation data from the Timp Lab

Bed-Style format methylation File
BAM conversion for Methylation Viewing in IGV
Citation

bed-style format methylation file

I convert the nanopolish methylation calling output into bed-style format, such that each line is

Contig	Start	End	Read name	Methylation call string	Log-likelihood ratios	Motif context

where Methylation call string is arranged such that

numbers are separated by methylation calls
each number is cumulative distance from the "start"
methylation call corresponds to the motif at position preceding the letter
"m" means methylated, "u" means unmethylated, and "x" means uncalled (not confident)

The resulting bed-style file is sorted, bgzipped, and tabix indexed for easy manipulation.

./mtsv2bedGraph.py -i [path/to/nanopolish/methylation.tsv] |\
  sort -k1,1 -k2,2n | bgzip > [methylation.bed.gz]
tabix -p bed [methylation.bed.gz]

converting bam for igv

Using the converted bed-style methylation file, the original bam file can be "bisulfite converted in silico" for easy visualization on IGV via their bisulfite mode. There are three options for specifying the region to convert:

-r,--regions : for multiple regions, supply the bed file
-w,--window : for one region, supply the coordinate (chr:start-end)
without either of the above options, all reads will be converted

./convert_bam_for_methylation.py -b [path/to/sorted.bam] \
  -c [path/to/cpg.methylation.bed.gz] -f [path/to/reference.fasta ] |\
  samtools sort -o [path/to/converted.bam]
samtools index [path/to/cnverted.bam]

For minimap2 alignments

Using --MD option during alignment is recommended.

The default output does not have MD tags, and MD tags are necessary for using pysam to get the reference sequence. To get around this, the fasta of reference genome must be supplied via -f,--fasta.

Citation

If you use this package in your work, please cite:

Lee, I. et al. (2019). Simultaneous profiling of chromatin accessibility and methylation on human cell lines with nanopore sequencing. bioRxiv. doi:10.1101/504993v2

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
test		test
.gitignore		.gitignore
README.md		README.md
convert_bam_for_methylation.py		convert_bam_for_methylation.py
convert_bam_for_methylation_cpggpc.py		convert_bam_for_methylation_cpggpc.py
extract_mbed_by_qname.py		extract_mbed_by_qname.py
megalodon_mcalls_to_bedGraph.py		megalodon_mcalls_to_bedGraph.py
methylation_R_utils.R		methylation_R_utils.R
methylbed_utils.py		methylbed_utils.py
mtsv2bedGraph.py		mtsv2bedGraph.py
mtsv2bedGraph_upperlower.py		mtsv2bedGraph_upperlower.py
parseMethylbed.py		parseMethylbed.py
split_bed_by_haplotype.py		split_bed_by_haplotype.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nanopore-methylation-utilities

bed-style format methylation file

converting bam for igv

For minimap2 alignments

Citation

About

Releases

Packages

Contributors 2

Languages

timplab/nanopore-methylation-utilities

Folders and files

Latest commit

History

Repository files navigation

nanopore-methylation-utilities

bed-style format methylation file

converting bam for igv

For minimap2 alignments

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages