Skip to content

Creating the SameAs DBpedia Index

F. Conrads edited this page May 14, 2019 · 5 revisions

Gerbil is capable of using a lucene indexed sameAs index retrieved from a DBpedia dump. The index is much faster in finding sameAs relations than the usual HTTP retrieval using the official DBpedia sparql endpoint.

To create and use the index follow these steps:

This process may take up to 20 hours, depending on your download speed

  1. clone the repo.
  2. execute index.sh to create the index (may take a while) (if you have a lot of ram you can use -ram to speed up the process)
  3. Move the files located in lucene_index to indexes/dbpedia in your gerbil folder
  4. Change your gerbil.properties accordingly

If not already stated in the gerbil.properties file add these lines:

org.aksw.gerbil.semantic.sameas.impl.index.IndexBasedSameAsRetriever.domain=dbpedia.org
org.aksw.gerbil.semantic.sameas.impl.index.IndexBasedSameAsRetriever.folder=${org.aksw.gerbil.DataPath}/indexes/dbpedia

Thats it.