Skip to content

v0.1.0

Latest
Compare
Choose a tag to compare
@SimJeg SimJeg released this 12 Dec 09:22
· 1 commit to main since this release
2b350b0

#24 by @maxjeblick and #29 by @SimJeg introduce a non-breaking refactoring:

  • a press does not require the compression_ratio input argument anymore as some presses do not explicitly require it (e.g. ThinKPress, SimLayerKVPress). However every press must have a compression_ratio attribute after any forward pass (assertion added in tests) to allow average compression ratio measurement on a benchmark
  • the core compression logic has been moved from BasePress.forward_hook to BasePress.compress. BasePress.forward_hook now only checks if compress must be called (pre-filling vs decoding), de-quantize cache before compress and re-quantize it afterwards
  • the BasePress does not implement a score method anymore, this has been moved to the ScorerPress with the associated ScorerPress.compress method

Other features: