Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bot repeating itself #1

Open
zhangce opened this issue Feb 15, 2023 · 1 comment
Open

Bot repeating itself #1

zhangce opened this issue Feb 15, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@zhangce
Copy link
Contributor

zhangce commented Feb 15, 2023

Problem

The bot often repeats itself.

Examples

User: hi
Bot: Hello, how may I help you?
User: how many legs does a cat has?
Bot: Let me check the information for you. A cat has four legs.
User: could you write a story
Bot: Sure, a story about how a cat has four legs. A cat has four legs. A cat has four legs. A cat has four legs.
User: that was not a story
Bot: Let me try this. A cat has four legs. A cat has four legs. A cat has four legs. A cat has four legs. A cat has four legs. A cat has four legs. A cat has four legs. A cat has four legs.
User: please do not repeat words, tell a story about bear and honey
Bot: Sure, a story about bear and honey. Bear and honey. Bear and honey. Bear and honey. Bear and honey. Bear and honey. Bear and honey. Bear and honey. Bear and honey. Bear and honey. Bear and honey. Bear and honey.

Potential Solutions

https://arxiv.org/abs/2210.00660

@zhangce zhangce added the enhancement New feature or request label Feb 15, 2023
@LorrinWWW
Copy link
Contributor

Another solution without touching the model:
https://github.com/huggingface/transformers/blob/ae54e3c3b18bac0832ad62ea9b896dfd52a09850/src/transformers/generation/logits_process.py#L469
It should be trivial to add to HF-based bots. FT does not support it natively but should be possible to add it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants