Parser for corpus_pubtator.txt #10

izuna385 · 2021-02-21T06:13:24Z

Hi, I just wrote parser for pubtator-fomat based annotation, specifically for this dataset and others.
https://github.com/izuna385/PubTator-Multiprocess-Parser

I hope you find it useful.

gpiat · 2021-03-08T13:39:14Z

Hello,
I thought this would be a good place to mention that there are a number of projects that help with loading PubTator-format files.

The oldest package I could find that handles PubTator is PubTator2Anndoc, but its scope is very limited.

Perhaps the first packages to implement multipurpose PubTator support were Kindred and bconv.

I've personally been working with MedMentions for over a year now and recently released the code I've been using to parse it on the python package index as well. It's called pubtatortool and can be found here.

Shortly thereafter, pubtator-loader and pubtator2dataset were released as well.

It may be good for anyone trying to decide which package to use to make some kind of table of supported features and post it in this thread.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parser for corpus_pubtator.txt #10

Parser for corpus_pubtator.txt #10

izuna385 commented Feb 21, 2021

gpiat commented Mar 8, 2021

Parser for corpus_pubtator.txt #10

Parser for corpus_pubtator.txt #10

Comments

izuna385 commented Feb 21, 2021

gpiat commented Mar 8, 2021