Rename sglang.bench_latency to sglang.bench_one_batch (#2118)
This commit is contained in:
@@ -59,7 +59,7 @@ For interactive debugging, you can compare the outputs of huggingface/transforme
|
||||
The following two commands should give the same text output and very similar prefill logits.
|
||||
|
||||
- Get the reference output by `python3 scripts/playground/reference_hf.py --model [new model]`
|
||||
- Get the SGLang output by `python3 -m sglang.bench_latency --correct --model [new model]`
|
||||
- Get the SGLang output by `python3 -m sglang.bench_one_batch --correct --model [new model]`
|
||||
|
||||
#### Add the model to the test suite
|
||||
To make sure the new model is well maintained in the future, it is better to add it to the test suite.
|
||||
|
||||
Reference in New Issue
Block a user