HERD

HERD (Hajen Entity Recognizer and Disambiguator) is a tool for automatically recognizing names in text (entity recognition) and specifying who is meant (disambiguation).

It is written in Java, and depends on Solr Text Tagger, by David Smiley and has a lot of inspiration from the Tulip project by Marek Lipczak et al.

The code will not run as is. It contains static paths to directories on my machine, and needs Wikipedia to be processed a couple of times to generate said files. It can be interesting for someone to read though.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src/main		src/main
.gitignore		.gitignore
README.md		README.md
mini-spark.iml		mini-spark.iml
pom.xml		pom.xml
spark-tagger.iml		spark-tagger.iml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HERD

About

Releases

Packages

Languages

aaaton/herd

Folders and files

Latest commit

History

Repository files navigation

HERD

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages