update README (#3543)
This commit is contained in:
1
.github/pull_request_template.md
vendored
1
.github/pull_request_template.md
vendored
@@ -15,3 +15,4 @@
|
|||||||
- [ ] Update documentation / docstrings / example tutorials as needed, according to [Writing Documentation](https://docs.sglang.ai/references/contribution_guide.html#writing-documentation-running-docs-ci).
|
- [ ] Update documentation / docstrings / example tutorials as needed, according to [Writing Documentation](https://docs.sglang.ai/references/contribution_guide.html#writing-documentation-running-docs-ci).
|
||||||
- [ ] Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to [Benchmark and Profiling](https://docs.sglang.ai/references/benchmark_and_profiling.html) and [Accuracy Results](https://docs.sglang.ai/references/accuracy_evaluation.html).
|
- [ ] Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to [Benchmark and Profiling](https://docs.sglang.ai/references/benchmark_and_profiling.html) and [Accuracy Results](https://docs.sglang.ai/references/accuracy_evaluation.html).
|
||||||
- [ ] For reviewers: If you haven't made any contributions to this PR and are only assisting with merging the main branch, please remove yourself as a co-author when merging the PR.
|
- [ ] For reviewers: If you haven't made any contributions to this PR and are only assisting with merging the main branch, please remove yourself as a co-author when merging the PR.
|
||||||
|
- [ ] Please feel free to join our Slack channel at https://slack.sglang.ai to discuss your PR.
|
||||||
|
|||||||
@@ -169,7 +169,7 @@ python3 -m sglang.bench_one_batch_server --model None --base-url http://10.0.0.1
|
|||||||
|
|
||||||
#### Troubleshooting
|
#### Troubleshooting
|
||||||
|
|
||||||
If you see the following error:
|
If you encounter the following error with fp16/bf16 checkpoint:
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
ValueError: Weight output_partition_size = 576 is not divisible by weight quantization block_n = 128.
|
ValueError: Weight output_partition_size = 576 is not divisible by weight quantization block_n = 128.
|
||||||
|
|||||||
Reference in New Issue
Block a user