Task Dataset Metric Software Extraction

This program produces the test data for classification over a set of predefined task#dataset#metrics#software labels.

Given input a pdf file, it scrapes the text from the filw using the Grobid parser, subsequently generating the test data file for input to the neural network classifier.

Steps to run the program

Clone this repository (https://github.com/jenlindadsouza/task-dataset-metric-extraction)
The program jar file can be found at https://github.com/jenlindadsouza/task-dataset-metric-extraction/tree/master/build/libs
- It is called tdm-1.0.jar
Usage: java -jar tdm-1.0.jar <pdf-file-path> <resources-dir> Where the pdf-file-path is the input pdf article to extract task, dataset, metric, software elements from and resources-dir should point to the directory at https://github.com/jenlindadsouza/task-dataset-metric-extraction/tree/master/src/main/resources
To run directly from the source code, the build file will have to be updated for the local dependencies for jar files. See here: https://github.com/jenlindadsouza/task-dataset-metric-extraction/blob/master/build.gradle
Either the jars can be obtained from source. For convenience, they are also made available here https://drive.google.com/drive/folders/1ax7ah8AInoD-_7amx27VKaSrFV73ggEk?usp=sharing

Acknowledgement:

This program reuses code modules from IBM's science-result-extractor (https://github.com/IBM/science-result-extractor). A reference url to their paper on the ACL anthology is https://www.aclweb.org/anthology/P19-1513

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
build/libs		build/libs
src/main		src/main
README.md		README.md
build.gradle		build.gradle
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Task Dataset Metric Software Extraction

Steps to run the program

Acknowledgement:

About

Releases

Packages

Languages

jd-coderepos/task-dataset-metric-extraction

Folders and files

Latest commit

History

Repository files navigation

Task Dataset Metric Software Extraction

Steps to run the program

Acknowledgement:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages