Skip to content

ntunlplab/ContributionSum

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

ContributionSum

The ContributionSum Dataset

Dataset Format

contribution.json

Each paper is represented as a key-value pair where the key is its arxiv id and the value contains the following fileds

  • structure: list of section titles (and its subsection titles if presented)
  • title: title of the paper
  • abstract: abstract of the paper
  • prompt: detected contribution prompt
  • extracted_contribution: list of author-written contributions
  • contribution_types: types of extracted contributions based on our annotation scheme: 0 - Approach, 1 - Result, 2 - Analysis, 3 - Topic or Resource
  • SECTION_TITLE: full text from a specific section

split.json

Includes train, test and validation set

Shared drive

https://drive.google.com/drive/folders/1p51KpJWBB-hV4PlF8Ms8qFXn1lhhKRKV?usp=drive_link

About

The ContributionSum Dataset

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published