DJL v0.22.1 release
Key Features
- Upgrades and enhancements for Engines
Enhancement
-
Introduces several enhancement for HuggingFace tokenizer:
-
Introduces several enhancement for Spark extension:
- Adds audio predictors (#2466)
- Adds more image predictors and change some APIs (#2456)
- Adds more text predictors (#2443)
- Adds np_util (#2419)
- Adds pyspark TextEmbedder and update ImageClassifier (#2414)
- Adds text generation in pyspark (#2477)
- Adds text2text generation (#2506)
- Adds whisper python code (#2513)
- Upgrades spark version to 3.3.2 (#2523)
-
DJL API improvements:
- Adds support for unique, bmm, xlogy (#2415)
- Fixes NDArray.toByteArray() bug (#2436)
- Adds NDArray.copyTo() support for NDArrayAdapter (#2437)
- Improves Classifications.toString() print out (#2439)
- Makes Batchifier serializable (#2441)
- Loads inputShapes in the loadMetadata method of Linear block (#2448)
- Adds chunked output support (#2453)
- Makes audio and cv translator factory serializable (#2455)
- Adds NamedEntity.toString() function (#2468)
- Streaming Predict and streamable BytesSupplier (#2470)
- Mitigates ZipInputStream CVE. (#2473)
- Adds getProperties() to Model interface (#2476)
- Adds non-blocking poll() for BytesSupplier (#2478)
- Makes PassthroughNDManager aware of engine and device (#2484)
- Fixes telemetry opt out (#2490)
- Uses sha-256 to avoid security warning (#2495)
- Moves NeuronUtils to api package (#2496)
- Adds encode and decode to Input and Output (#2502)
- Fails model loading if specified translator not found (#2515)
- Adds a way to check if streaming is supported (#2518)
- Fixed detect platform for different CUDA version (#2527)
- Fixes neuron core detection in docker container (#2536)
-
PyTorch engine improvements:
-
OnnxRuntime engine improvements:
- Adds support for OnnxRuntime Profiler (#2472)
-
MXNet engine improvements:
- Enables boolean index on mxnet (#2427)
Breaking change
Bug fixes
- Fixes pytorch-native-cu118 package in BOM (#2535)
- Fixes spark package name in BOM (#2534)
- Fixes OnnxRuntime version (#2524)
- Fixes memory leak in get with and to long, double, float, ... (#2428)
Documentation and Examples
- Adds timeseries examples document (#2411)
- Fixes link to doc Mask detection with YOLOv5 (#2529)
- Adds DeferredTranslatorFactory to tokenizers example (#2511)
- Updates depednency manage for spark extension (#2531)
- Updates FAQ and troubleshooting documents (#2454)
- Cleans inference performance optimization doc (#2519)
- Adds Yolov5 on Face Mask Detection (#2452)
CI improvements:
- Simplifies BOM build script (#2438)
- Avoids re-publish serving tarball (#2479)
- Fixes gradle 8.0 native publish issue (#2457)
- Fixes gradle 8.0 publish to release issue (#2460)
- Upgrades gradle to 8.0.2 (#2449)
- Uses recommended way to create task in build.gradle (#2451)
Contributors
@frankfliu
@KexinFeng
@lanking520
@nezda
@tipame
@xyang16
@zachgk
New Contributors
Full Changelog: v0.21.0...v0.22.1