Nucleus sampling with beam? #4882

lenbrocki · 2022-11-15T15:43:26Z

lenbrocki
Nov 15, 2022

I am a bit confused about how exactly the inference works when I choose nucleus sampling, in particular why there is a beam size as parameter. My understanding was that nucleus sampling does not do beam search, but instead it is a version of random sampling(so randomly choose on each decoding step). Does the implementation of nucleus sampling in ParlAI somehow combine this with beam search?

Answered by klshuster

Nov 16, 2022

When using sampling, you can think of the beam_size parameter as more of a best_of_n parameter; if you specify e.g. --beam-size 5 with nucleus sampling, ParlAI will sample 5 generations in parallel and output the one with the highest score at the end.

View full answer

klshuster · 2022-11-16T15:09:15Z

klshuster
Nov 16, 2022

When using sampling, you can think of the beam_size parameter as more of a best_of_n parameter; if you specify e.g. --beam-size 5 with nucleus sampling, ParlAI will sample 5 generations in parallel and output the one with the highest score at the end.

1 reply

lenbrocki Nov 17, 2022
Author

I see, thanks for the answer!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nucleus sampling with beam? #4882

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Nucleus sampling with beam? #4882

lenbrocki Nov 15, 2022

Replies: 1 comment · 1 reply

klshuster Nov 16, 2022

lenbrocki Nov 17, 2022 Author

lenbrocki
Nov 15, 2022

Replies: 1 comment 1 reply

klshuster
Nov 16, 2022

lenbrocki Nov 17, 2022
Author