Skip to content

Creating the SameAs DBpedia Index

F. Conrads edited this page May 8, 2019 · 5 revisions

Gerbil is capable of using a lucene indexed sameAs index retrieved from a DBpedia dump. The index is much faster in finding sameAs relations than the usual HTTP retrieval using the official DBpedia sparql endpoint.

To create and use the index follow these steps: Be aware that this procedure uses a lot of disk space.

  1. clone the repo.
  2. execute index.sh to create the index (may take a while)
  3. Move the files located in lucene_index to indexes/dbpedia in your gerbil folder
  4. Change your gerbil.properties accordingly

If not already stated in the gerbil.properties file add these lines:

org.aksw.gerbil.semantic.sameas.impl.index.IndexBasedSameAsRetriever.domain=dbpedia.org
org.aksw.gerbil.semantic.sameas.impl.index.IndexBasedSameAsRetriever.folder=${org.aksw.gerbil.DataPath}/indexes/dbpedia

Thats it.