From d03c4c25a7406e1797062155bf9dbf15e68e95bd Mon Sep 17 00:00:00 2001 From: Shi Shuai <126407087+shuaills@users.noreply.github.com> Date: Mon, 17 Feb 2025 02:52:30 +0000 Subject: [PATCH] [docs] Update sampling_params.md (#3617) --- docs/references/sampling_params.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/references/sampling_params.md b/docs/references/sampling_params.md index ca7cc3c5f..279171eb8 100644 --- a/docs/references/sampling_params.md +++ b/docs/references/sampling_params.md @@ -49,7 +49,7 @@ Please refer to our dedicated guide on [constrained decoding](https://docs.sglan ### Other options -* `n`: ? +* `n`: Specifies the number of output sequences to generate per request. (Generating multiple outputs in one request (n > 1) is discouraged; separate requests offer better control and efficiency.) * `spaces_between_special_tokens`: Whether or not to add spaces between special tokens during detokenization. * `no_stop_trim`: Don't trim stop words or EOS token from the generated text. * `ignore_eos`: Don't stop generation when EOS token is sampled.