Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Updated Annoy Tutorial with Text8 #1355

Merged
merged 2 commits into from
May 23, 2017
Merged

Updated Annoy Tutorial with Text8 #1355

merged 2 commits into from
May 23, 2017

Conversation

pmbaumgartner
Copy link
Contributor

Updates:

  • Runs in python 3
  • Added required packages (in addition to gensim) to top of notebook
  • Cleaned up some descriptive language throughout the notebook.
  • Used Text8 Corpus rather than Lee corpus, since I wasn't seeing an improvement using the Lee corpus. My hypothesis is that it's too small to see any benefit from approximating.
  • Added watermark info for benchmarking comparisons
  • Reorganized and restructured notebook
    • Added an outline according to the logic of the notebook
    • separated notebook cells for import, function definition, and executing code
    • Added optional logging
    • Combined num_trees performance analysis into one loop & plots into one figure and upped num_trees to 300

@tmylk

@tmylk
Copy link
Contributor

tmylk commented May 22, 2017

Thanks a lot! Let's keep the old name and replace the Lee corpus ipynb with this time.

@pmbaumgartner
Copy link
Contributor Author

Okay, should be good now, just renamed it.

@tmylk tmylk merged commit b5131ab into piskvorky:develop May 23, 2017
@tmylk
Copy link
Contributor

tmylk commented May 23, 2017

awesome thx

@piskvorky
Copy link
Owner

Beautiful!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants