Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Custom data uploaded, simple extractors work, but use of more complex extractors unclear #271

Open
cslovell opened this issue Nov 2, 2018 · 0 comments

Comments

@cslovell
Copy link

cslovell commented Nov 2, 2018

Hi,

We've uploaded some custom data extracted from a set of webpages, attached below as a zipped .jl file.

output.zip

We've loaded this into DIG and added a date extractor along with title and descriptions. See below.

currentkg

We'd like to add or train custom extractors for things like cities, countries, and perhaps topics. We understand that we can do this through custom ETK modules, glossaries, and the spacy rules editor (which was used to derive the dates). However, our list of cities or countries failed, and we're stuck about how to develop and use ETK modules.

Would it be possible to provide an example of how we can add city extractors, country extractors and perhaps some custom things for topic? Also, how could we ingest data from excel files and tabular data? This would be immensely useful to us.

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant