Skip to content

add GPU quantization support

Compare
Choose a tag to compare
@pommedeterresautee pommedeterresautee released this 08 Dec 22:46
· 161 commits to main since this release
ad837a9
  • support int-8 GPU quantization
  • add a tuto to perform quantization end to end
  • add QDQRoberta model
  • switch to ONNX opset 13
  • refactoring in the TensorRT engine creation
  • fix bugs
  • add auth token (for private HF repo)

What's Changed

New Contributors

Full Changelog: v0.1.1...v0.2.0