Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extract keywords from multiple documents given a nested list of candidates for each document. #215

Open
kevinmt24 opened this issue Mar 14, 2024 · 1 comment

Comments

@kevinmt24
Copy link

It's mentioned extracting keywords from multiple documents is faster, is it possible to do this along with candidates. If I extract candidates using Yake for multiple documents, I cannot add a list of candidates list to .extract_keywords(). Hence I would extract keywords for each document in a loop, while adding the candidates list for doc.

@MaartenGr
Copy link
Owner

You can use the candidates parameter in .extract_keywords to pass a list of candidate keywords. This should still speed up the calculation. Do note though that this does not accept a nested list so you will have to flatten the list.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants