Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Suggestion - not an issue as such #2

Open
lulu-2021 opened this issue Nov 24, 2015 · 0 comments
Open

Suggestion - not an issue as such #2

lulu-2021 opened this issue Nov 24, 2015 · 0 comments

Comments

@lulu-2021
Copy link

I am working with very large volumes of data and since the IDF has to relate to the whole document space - it is difficult with your version to load the entire document space.. Based on your concepts I have re-worked the code to use streams and lazy eval for a large corpus and also OTP based parallelisation.. Once i am happy with the code test wise I am happy to submit a PR - if you think this would add value to your version - otherwise I will build my own :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant