v0.5.0
What's New
Several improvements to avoid CPU <> GPU device synchronizations, GLU support, and support for some new models 👀
What's Changed
- Update version by @mvpatel2000 in #36
- Avoid duplicate
.cpu()
call by @mvpatel2000 in #37 - Have megablocks rely on torch default precision by @mvpatel2000 in #39
- Add GLU support by @sashaDoubov in #38
- Enable generic dimentionality for input by @vchiley in #41
- Removing an extra size call by @bcui19 in #43
- Fix bug in topology kernel for ffn_hidden_size>4096. by @tgale96 in #47
New Contributors
- @sashaDoubov made their first contribution in #38
- @bcui19 made their first contribution in #43
Full Changelog: v0.4.0...v0.5.0