Releases: explosion/sense2vec
v1.0.0a7
v1.0.0a6
v1.0.0a5
v1.0.0a4
v1.0.0a3
v1.0.0a2: Refactor and modernize, spaCy v2.2 support, more features, Prodigy recipes
⚠️ This is an alpha release and not yet ready for production. You can download sense2vec via pip by specifying the exact version.pip install sense2vec==1.0.0a2The converted Reddit vectors (trained on all comments of 2015) are attached to this release as a
.tar.gz
file. For more details and usage instructions, see theREADME
.
✨ New features and improvements
- Completely rewrite package from scratch.
- Replace built-in vector storage with spaCy's
Vectors
, making this package a pure Python package and allowing easy out-of-the-box serialization of vectors. - Add fully serializable spaCy pipeline component and extension attributes.
- Add new methods
get_best_sense
andget_other_senses
and improvemost_similar
. - Add annotation recipes for Prodigy to easily create word lists and match patterns from similar phrases using sense2vec vectors (like the
terms.teach
recipe, just with multi-word expressions). - New and more efficient training and preprocessing scripts using GloVe.
⚠️ Backwards incompatibilities
- The
sense2vec.load
method has been removed. UseSense2Vec.from_disk
instead. - The previous
VectorMap
andVectorStorage
have been removed. - This package now requires Python 3.6+.
- This update requires a new vectors format (see attached
.tar.gz
).
📖 Documentation and examples
- Rewrite
README
from scratch and include full API docs.
👥 Contributors
Thanks to @kabirkhan for contributing the Prodigy recipes!
v1.0.0a1: Update sense2vec for spaCy v2.1.x or standalone use
⚠️ This is an alpha release and not yet ready for production. You can download sense2vec via pip by specifying the exact version.pip install sense2vec==1.0.0a1Note that the library doesn't depend on spaCy anymore, so you might have to install spaCy and the English model separately. The Reddit vectors (trained on all comments of 2015) are attached to this release as a
.tar.gz
file. For more details and usage instructions, see theREADME
.
✨ New features and improvements
- NEW: Remove spaCy dependency and allow standalone use of the
sense2vec
library. - NEW: Include spaCy v2.x pipeline component to add sense2vec-compatible token merging and token attributes and methods.
- Attach
reddit_vectors
model to release and make it easier to download and load in models.
📖 Documentation and examples
- Rewrite
README
from scratch and include full API docs.
🚧 Todo
- Replace
VectorMap
implementation with spaCy'sVectors
class. - Don't merge tokens at runtime and adjust extension attributes accordingly.
- Update training and pre-processing scripts for spaCy v2.x.
- Retrain vectors on more data.
v1.0.0a0: Update sense2vec for spaCy v2.x or standalone use
⚠️ This is an alpha release and not yet ready for production. You can download sense2vec via pip by specifying the exact version.pip install sense2vec==1.0.0a0Note that the library doesn't depend on spaCy anymore, so you might have to install spaCy and the English model separately. The Reddit vectors (trained on all comments of 2015) are attached to this release as a
.tar.gz
file. For more details and usage instructions, see theREADME
.
✨ New features and improvements
- NEW: Remove spaCy dependency and allow standalone use of the
sense2vec
library. - NEW: Include spaCy v2.x pipeline component to add sense2vec-compatible token merging and token attributes and methods.
- Attach
reddit_vectors
model to release and make it easier to download and load in models.
📖 Documentation and examples
- Rewrite
README
from scratch and include full API docs.
🚧 Todo
- Update training and pre-processing scripts for spaCy v2.x.