Skip to content
Lev Konstantinovskiy edited this page Sep 25, 2016 · 6 revisions

May - July 2016:

  • Beginner doc2vec tutorial. See tutorial
  • Intermediate word2vec and doc2vec tutorials based on common mailing list questions. Fixed via code warnings.
  • Visualisation for comparing two LDA models based on PyLDAVis. Maybe rewrite in Bokeh so can keep developing in Python.

Jul - Nov 2016:

  • update website on radimrehurek.com
  • Running Gensim on Spark or Hadoop.
  • Reading the ouput of Spark MLLib LDA model into gensim.
  • GPU integration. Keras integration. Evaluation of word2veckeras.
  • SpaCy collaboration: sparsetools, common API.
  • Sklearn integration (here's how keras does their sklearn wrappers, add module to sklearn_contrib github repo).
  • GPU part 2. TensorFlow SVD for LSI, word2vec and glove integration. Replacement of existing word2vec code.
Clone this wiki locally