This is a fork of LaTeXML focusing on TEI transformation, in particular directed to the TEI customization produced by Grobid and Pub2TEI. The goal is to use the same XML representation from a variety of sources and formats, with minimal information loss. Here, we focus on the LaTeX sources available on arXiv (around 2M LaTeX sources). See How GROBID works for an overview of the target ingestion process.
LaTeXML is a TeX & LaTeX to XML, HTML, MathML, ePub, JATS, ... converter.
See the included Manual for documentation.
The official project home page is at http://dlmf.nist.gov/LaTeXML/.
LaTeXML development is currently hosted on GitHub, where you can retrieve and browse the current source, along with an Issue tracker and Wiki.
For general discussion feel free to join the mailing list.
See the LICENSE file for copyright and licensing information.
Bruce R. Miller, mailto:bruce.miller@nist.gov, Deyan Ginev, mailto:deyan.ginev@gmail.com.