-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug fixes] fix generation cache bug #5535
Conversation
Thanks for your contribution! |
Codecov Report
@@ Coverage Diff @@
## develop #5535 +/- ##
===========================================
+ Coverage 59.23% 59.31% +0.07%
===========================================
Files 478 481 +3
Lines 67558 68097 +539
===========================================
+ Hits 40021 40392 +371
- Misses 27537 27705 +168
... and 24 files with indirect coverage changes Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
重点检查一下 OPT、GPT、GLM、Bloom是否可以适配相应的修改
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
beam search那儿也要加下
@gongel 我这边看了下, |
这个问题是 #5025 引入的,京京可以看下修复后cache是否被清空。 |
PR types
Bug fixes
PR changes
Models
Description
解决生成式 API cache 被清空的bug