Skip to content

DJL v0.22.1 release

Compare
Choose a tag to compare
@frankfliu frankfliu released this 27 Apr 14:57
· 571 commits to master since this release

Key Features

  • Upgrades and enhancements for Engines
    • Upgrades PyTorch to 1.13.1 (#2245)
    • Upgrades TensorFlow engine to 2.10.1 (#2440)
    • Upgrades XGBoost to 1.7.5 (#2522)
    • DJLServing release 0.22.1

Enhancement

  • Introduces several enhancement for HuggingFace tokenizer:

    • Allows tokenizer native library load from different classloader (#2465)
    • Makes Huggingface model zoo lazy load (#2469)
    • Make Huggingface tokenizers translator factory serializable (#2442)
  • Introduces several enhancement for Spark extension:

    • Adds audio predictors (#2466)
    • Adds more image predictors and change some APIs (#2456)
    • Adds more text predictors (#2443)
    • Adds np_util (#2419)
    • Adds pyspark TextEmbedder and update ImageClassifier (#2414)
    • Adds text generation in pyspark (#2477)
    • Adds text2text generation (#2506)
    • Adds whisper python code (#2513)
    • Upgrades spark version to 3.3.2 (#2523)
  • DJL API improvements:

    • Adds support for unique, bmm, xlogy (#2415)
    • Fixes NDArray.toByteArray() bug (#2436)
    • Adds NDArray.copyTo() support for NDArrayAdapter (#2437)
    • Improves Classifications.toString() print out (#2439)
    • Makes Batchifier serializable (#2441)
    • Loads inputShapes in the loadMetadata method of Linear block (#2448)
    • Adds chunked output support (#2453)
    • Makes audio and cv translator factory serializable (#2455)
    • Adds NamedEntity.toString() function (#2468)
    • Streaming Predict and streamable BytesSupplier (#2470)
    • Mitigates ZipInputStream CVE. (#2473)
    • Adds getProperties() to Model interface (#2476)
    • Adds non-blocking poll() for BytesSupplier (#2478)
    • Makes PassthroughNDManager aware of engine and device (#2484)
    • Fixes telemetry opt out (#2490)
    • Uses sha-256 to avoid security warning (#2495)
    • Moves NeuronUtils to api package (#2496)
    • Adds encode and decode to Input and Output (#2502)
    • Fails model loading if specified translator not found (#2515)
    • Adds a way to check if streaming is supported (#2518)
    • Fixed detect platform for different CUDA version (#2527)
    • Fixes neuron core detection in docker container (#2536)
  • PyTorch engine improvements:

    • Upgrades PyTorch engine to 2.0.0 (#2525)
    • Implements unique operator for PyTorch engine (#2417)
    • Adds yolov5s to pytorch model zoo (#2433)
    • Respect PYTORCH_FLAVOR override to download libtorch (#2486)
    • Print log if graph optimizer is enabled (#2501)
  • OnnxRuntime engine improvements:

    • Adds support for OnnxRuntime Profiler (#2472)
  • MXNet engine improvements:

    • Enables boolean index on mxnet (#2427)

Breaking change

Bug fixes

  • Fixes pytorch-native-cu118 package in BOM (#2535)
  • Fixes spark package name in BOM (#2534)
  • Fixes OnnxRuntime version (#2524)
  • Fixes memory leak in get with and to long, double, float, ... (#2428)

Documentation and Examples

  • Adds timeseries examples document (#2411)
  • Fixes link to doc Mask detection with YOLOv5 (#2529)
  • Adds DeferredTranslatorFactory to tokenizers example (#2511)
  • Updates depednency manage for spark extension (#2531)
  • Updates FAQ and troubleshooting documents (#2454)
  • Cleans inference performance optimization doc (#2519)
  • Adds Yolov5 on Face Mask Detection (#2452)

CI improvements:

  • Simplifies BOM build script (#2438)
  • Avoids re-publish serving tarball (#2479)
  • Fixes gradle 8.0 native publish issue (#2457)
  • Fixes gradle 8.0 publish to release issue (#2460)
  • Upgrades gradle to 8.0.2 (#2449)
  • Uses recommended way to create task in build.gradle (#2451)

Contributors

@frankfliu
@KexinFeng
@lanking520
@nezda
@tipame
@xyang16
@zachgk

New Contributors

Full Changelog: v0.21.0...v0.22.1