Skip to content

Releases: explosion/sense2vec

v1.0.0a7

19 Nov 15:45
Compare
Choose a tag to compare
v1.0.0a7 Pre-release
Pre-release
Increment version

v1.0.0a6

03 Nov 17:17
Compare
Choose a tag to compare
v1.0.0a6 Pre-release
Pre-release
Increment version [ci skip]

v1.0.0a5

02 Nov 16:46
Compare
Choose a tag to compare
v1.0.0a5 Pre-release
Pre-release
Increment version [ci skip]

v1.0.0a4

02 Nov 16:45
Compare
Choose a tag to compare
v1.0.0a4 Pre-release
Pre-release
Update version [ci skip]

v1.0.0a3

02 Nov 16:45
Compare
Choose a tag to compare
v1.0.0a3 Pre-release
Pre-release
Update README.md [ci skip]

v1.0.0a2: Refactor and modernize, spaCy v2.2 support, more features, Prodigy recipes

31 Oct 21:17
d11dbef
Compare
Choose a tag to compare

⚠️ This is an alpha release and not yet ready for production. You can download sense2vec via pip by specifying the exact version.

pip install sense2vec==1.0.0a2

The converted Reddit vectors (trained on all comments of 2015) are attached to this release as a .tar.gz file. For more details and usage instructions, see the README.


✨ New features and improvements

  • Completely rewrite package from scratch.
  • Replace built-in vector storage with spaCy's Vectors, making this package a pure Python package and allowing easy out-of-the-box serialization of vectors.
  • Add fully serializable spaCy pipeline component and extension attributes.
  • Add new methods get_best_sense and get_other_senses and improve most_similar.
  • Add annotation recipes for Prodigy to easily create word lists and match patterns from similar phrases using sense2vec vectors (like the terms.teach recipe, just with multi-word expressions).
  • New and more efficient training and preprocessing scripts using GloVe.

⚠️ Backwards incompatibilities

  • The sense2vec.load method has been removed. Use Sense2Vec.from_disk instead.
  • The previous VectorMap and VectorStorage have been removed.
  • This package now requires Python 3.6+.
  • This update requires a new vectors format (see attached .tar.gz).

📖 Documentation and examples

  • Rewrite README from scratch and include full API docs.

👥 Contributors

Thanks to @kabirkhan for contributing the Prodigy recipes!

v1.0.0a1: Update sense2vec for spaCy v2.1.x or standalone use

12 Sep 14:12
Compare
Choose a tag to compare

⚠️ This is an alpha release and not yet ready for production. You can download sense2vec via pip by specifying the exact version.

pip install sense2vec==1.0.0a1

Note that the library doesn't depend on spaCy anymore, so you might have to install spaCy and the English model separately. The Reddit vectors (trained on all comments of 2015) are attached to this release as a .tar.gz file. For more details and usage instructions, see the README.


✨ New features and improvements

  • NEW: Remove spaCy dependency and allow standalone use of the sense2vec library.
  • NEW: Include spaCy v2.x pipeline component to add sense2vec-compatible token merging and token attributes and methods.
  • Attach reddit_vectors model to release and make it easier to download and load in models.

📖 Documentation and examples

  • Rewrite README from scratch and include full API docs.

🚧 Todo

  • Replace VectorMap implementation with spaCy's Vectors class.
  • Don't merge tokens at runtime and adjust extension attributes accordingly.
  • Update training and pre-processing scripts for spaCy v2.x.
  • Retrain vectors on more data.

v1.0.0a0: Update sense2vec for spaCy v2.x or standalone use

08 Apr 15:31
Compare
Choose a tag to compare

⚠️ This is an alpha release and not yet ready for production. You can download sense2vec via pip by specifying the exact version.

pip install sense2vec==1.0.0a0

Note that the library doesn't depend on spaCy anymore, so you might have to install spaCy and the English model separately. The Reddit vectors (trained on all comments of 2015) are attached to this release as a .tar.gz file. For more details and usage instructions, see the README.


✨ New features and improvements

  • NEW: Remove spaCy dependency and allow standalone use of the sense2vec library.
  • NEW: Include spaCy v2.x pipeline component to add sense2vec-compatible token merging and token attributes and methods.
  • Attach reddit_vectors model to release and make it easier to download and load in models.

📖 Documentation and examples

  • Rewrite README from scratch and include full API docs.

🚧 Todo

  • Update training and pre-processing scripts for spaCy v2.x.