doc2vec-lee.ipynb results ... not even close #1088

johncleveland · 2017-01-12T21:46:27Z

For this github tutorial: gensim/docs/notebooks/doc2vec-lee.ipynb
I have copied the code verabtim and I have been unable to reproduce any near the 95% rate.
collections.Counter(ranks) #96% accuracy
Counter({0: 292, 1: 8})

I have used python 2.7.12, 2.7.13, 3.5 on both Windows 10 and Ubuntu 16.10.
I have also had a friend try it on his Windows system. My results are all over the place.
What could possibly be the problem. I am just copy pasting?
Thanks

gojomo · 2017-01-12T23:46:58Z

Open-ended questions/discussion that are not bug-reports or feature-requests should go to the project discussion list at https://groups.google.com/forum/#!forum/gensim rather than this issues-tracker.

So please post your question there. (When you do so, it'd be helpful to make clear whether you've tried running the code in a Jupyter notebook itself and had the same problem, and what gensim version you're using, and what exact results or logged output you are seeing rather than what you expect.)

piskvorky · 2017-01-13T01:39:42Z

Looks like a (little incomplete) bug report to me.

gojomo · 2017-01-16T21:16:42Z

Reopening, as it does seem that our updating of Doc2Vec defaults made the examples in this notebook less effective and stable - see discussion thread at https://groups.google.com/d/msg/gensim/bs77ke1Zun0/9lrMo_w0CAAJ

I believe upping the iter to 50 restores the intent of the example, without changing other defaults. Some text could be added alongside the related cells to the effect of: (1) small datasets with short documents can benefit from more training passes; (2) the checking of an inferred-vector against a training-vector is a sort of 'sanity check' as to whether the model is behaving in a usefully consistent manner, though not a real 'accuracy' value.

Thanks, @johncleveland, for catching and reporting this!

ELind77 · 2017-01-28T03:47:55Z

This may not be the right place for this, but if this is the original paragraph vectors paper, I believe there have been some serious problems with the reproducibility of those findings. In Ensemble of Generative and Discriminative Techniques for Sentiment Analysis of Movie Reviews Mikolov even has a footnote that explains that the results were not reproducible.

gojomo · 2017-01-28T08:36:41Z

Yes, you can find posts across the net in a bunch of places from people who've been frustrated trying to reproduce the PV paper's error rates on the same original datasets, and a few comments by Mikolov (like that footnote) implying Le made a mistake in result-reporting.

Here, it's just a matter of our demo, on a different much smaller dataset, not behaving the same across some other code changes.

gojomo closed this as completed Jan 12, 2017

gojomo reopened this Jan 16, 2017

tmylk added documentation Current issue related to documentation difficulty easy Easy issue: required small fix labels Jan 25, 2017

bahbbc mentioned this issue Jan 29, 2017

Fix doc2vec-lee.ipynb results to match previous behavior #1119

Merged

johncleveland closed this as completed Jul 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

doc2vec-lee.ipynb results ... not even close #1088

doc2vec-lee.ipynb results ... not even close #1088

johncleveland commented Jan 12, 2017

gojomo commented Jan 12, 2017

piskvorky commented Jan 13, 2017 •

edited

Loading

gojomo commented Jan 16, 2017 •

edited

Loading

ELind77 commented Jan 28, 2017

gojomo commented Jan 28, 2017

doc2vec-lee.ipynb results ... not even close #1088

doc2vec-lee.ipynb results ... not even close #1088

Comments

johncleveland commented Jan 12, 2017

gojomo commented Jan 12, 2017

piskvorky commented Jan 13, 2017 • edited Loading

gojomo commented Jan 16, 2017 • edited Loading

ELind77 commented Jan 28, 2017

gojomo commented Jan 28, 2017

piskvorky commented Jan 13, 2017 •

edited

Loading

gojomo commented Jan 16, 2017 •

edited

Loading