AxCell: Automatic Extraction of Results from Machine Learning Papers

This repository is the official implementation of AxCell: Automatic Extraction of Results from Machine Learning Papers.

Requirements

To create a conda environment named axcell and install requirements run:

conda env create -f environment.yml

Additionally, axcell requires docker (that can be run without sudo). Run scripts/pull_docker_images.sh to download necessary images.

Datasets

We publish the following datasets:

See datasets notebook for an example of how to load the datasets provided below. The extraction notebook shows how to use axcell to extract text and tables from papers.

Evaluation

See the evaluation notebook for the full example on how to evaluate AxCell on the PWCLeaderboards dataset.

Training

pre-training language model on the ArxivPapers dataset
table type classifier and table segmentation on the SegmentedResults dataset

Pre-trained Models

You can download pretrained models here:

axcell — an archive containing the taxonomy, abbreviations, table type classifier and table segmentation model. See the results-extraction notebook for an example of how to load and run the models
language model — ULMFiT language model pretrained on the ArxivPapers dataset

Results

AxCell achieves the following performance:

Dataset	Macro F1	Micro F1
PWC Leaderboards	21.1	28.7
NLP-TDMS	19.7	25.8

License

AxCell is released under the Apache 2.0 license.

Citation

The pipeline is described in the following paper:

@inproceedings{axcell,
    title={AxCell: Automatic Extraction of Results from Machine Learning Papers},
    author={Marcin Kardas and Piotr Czapla and Pontus Stenetorp and Sebastian Ruder and Sebastian Riedel and Ross Taylor and Robert Stojnic},
    year={2020},
    booktitle={2004.14356}
}

Name		Name	Last commit message	Last commit date
Latest commit History 187 Commits
axcell		axcell
notebooks		notebooks
scripts		scripts
test		test
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
docker-compose.yml		docker-compose.yml
environment.yml		environment.yml
pytest.ini		pytest.ini
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AxCell: Automatic Extraction of Results from Machine Learning Papers

Requirements

Datasets

Evaluation

Training

Pre-trained Models

Results

License

Citation

About

Releases 1

Packages

Contributors 6

Languages

License

paperswithcode/axcell

Folders and files

Latest commit

History

Repository files navigation

AxCell: Automatic Extraction of Results from Machine Learning Papers

Requirements

Datasets

Evaluation

Training

Pre-trained Models

Results

License

Citation

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 6

Languages

Packages