update README (#3543)
This commit is contained in:
1
.github/pull_request_template.md
vendored
1
.github/pull_request_template.md
vendored
@@ -15,3 +15,4 @@
|
||||
- [ ] Update documentation / docstrings / example tutorials as needed, according to [Writing Documentation](https://docs.sglang.ai/references/contribution_guide.html#writing-documentation-running-docs-ci).
|
||||
- [ ] Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to [Benchmark and Profiling](https://docs.sglang.ai/references/benchmark_and_profiling.html) and [Accuracy Results](https://docs.sglang.ai/references/accuracy_evaluation.html).
|
||||
- [ ] For reviewers: If you haven't made any contributions to this PR and are only assisting with merging the main branch, please remove yourself as a co-author when merging the PR.
|
||||
- [ ] Please feel free to join our Slack channel at https://slack.sglang.ai to discuss your PR.
|
||||
|
||||
@@ -169,7 +169,7 @@ python3 -m sglang.bench_one_batch_server --model None --base-url http://10.0.0.1
|
||||
|
||||
#### Troubleshooting
|
||||
|
||||
If you see the following error:
|
||||
If you encounter the following error with fp16/bf16 checkpoint:
|
||||
|
||||
```bash
|
||||
ValueError: Weight output_partition_size = 576 is not divisible by weight quantization block_n = 128.
|
||||
|
||||
Reference in New Issue
Block a user