Releases: GRAAL-Research/deepparse
Releases · GRAAL-Research/deepparse
0.9.3
- Improve error handling.
- Bug-fix FastText error not handled in test API.
- Add feature to allow new_prediction_tags to retrain CLI.
0.9.2
- Improve Deepparse server error handling and error output
- Remove deprecated argument saving_dir in download_fasttext_magnitude_embeddings and download_fasttext_embeddings functions
- Add offline argument to remove verification of the latest version
- Bug-fix cache handling in download model
- Add download_models CLI function
- #156
0.9.1
Hotfix cli.download_model
attention model bug
0.9
- Add
save_model_weights
method toAddressParser
to save model weights (PyTorch state dictionary) - Improve CI
- Added verbose flag for the test to activate or deactivate the test verbosity (it overrides the AddressParser verbosity)
- Add Docker image
- Add
val_dataset
to retrain API to allow the use of a specific val dataset for training - Remove deprecated
download_from_url
function - Remove deprecated
dataset_container
argument - Fixed error and docs
- Added the UK retrain example
Create Zenodo DOI
Create Zenodo DOI
0.8.2
- Bug-fix retrain attention model naming parsing
- Improve error handling when not a DatasetContainer is used in retrain and test API
- Add DOI
0.8.1
- Refactored function
download_from_url
todownload_from_public_repository
. - Add error management when retrain a FastText like model on Windows with a number of workers (
num_workers
) greater than 0. - Improve dev tooling
- Improve CI
- Improve code coverage and pylint
- Add Codacy
0.8
- Improve SEO.
- Add cache_dir arg in all CLI functions.
- Improve handling of HTTP error in models version verification.
- Improve doc.
- Add a note for parsing data cleaning (i.e. lowercase, commas removal, and hyphen replacing).
- Add hyphen parsing cleaning step (with a bool flag to activate or not) to improve some country address parsing (see issue 137).
- Add ListDatasetContainer for Python list dataset.
0.7.6
- Re-release the version 0.7.5 into 0.7.6 due to manipulation error and change in PyPi (now delete does not delete release by yank does).
0.7.5
- Bug-fix Poutyne version handling that causes a print error when a version is 1.11 when retraining
- Add the option to create a named retrain parsing model using by default the architecture setting or using the user-given name
- Hot-fix missing raise for DataError validation of address to parse when address is tuple
- Bug-fix handling of string column name for CSVDatasetContainer that raised ValueError
- Improve parse CLI doc and fix error in doc stating JSON format is supported as input data
- Add batch_size to parse CLI
- Add minimum version to Gensim 4.0.0.
- Add a new CLI function, retrain, to retrain from the command line
- Improve doc
- Add
cache_dir
to the BPEmb embedding model and toAddressParser
to change the embeddings cache directory and models weights cache directory - Change the
saving_dir
argument ofdownload_fastext_embeddings
anddownload_fasttext_magnitude_embeddings
function
tocache_dir
.saving_dir
is now deprecated and will be remove in version 0.8. - Add a new CLI function, test, to test from the command line