github-actions
released this
13 Aug 09:17
·
61 commits
to main
since this release
What's Changed
- Feature(MInference): support LLaMA-3-70B-1M and multi-gpu PP by @iofu728 in #59
- Fix(MInference): fix e2e benchmark guideline & fix A-shape multi gpu by @iofu728 in #66
- Fix(MInference): fix the vs pattern loss / sqrt(dk) by @PiotrNawrot in #70
Full Changelog: v0.1.5...v0.1.5.post1