Rename sglang.bench_latency to sglang.bench_one_batch (#2118)

2024-11-21 20:07:48 -08:00
parent 8048c28c11
commit dfec7fca06
16 changed files with 521 additions and 599 deletions
--- a/python/sglang/README.md
+++ b/python/sglang/README.md
@@ -4,9 +4,11 @@
 - `srt`: The backend engine for running local models. (SRT = SGLang Runtime).
 - `test`: The test utilities.
 - `api.py`: The public APIs.
- `bench_latency.py`: Benchmark the latency of running a single static batch.
- `bench_server_latency.py`: Benchmark the latency of serving a single batch with a real server.
+- `bench_offline_throughput.py`: Benchmark the throughput in the offline mode.
+- `bench_one_batch.py`: Benchmark the latency of running a single static batch without a server.
+- `bench_one_batch_server.py`: Benchmark the latency of running a single batch with a server.
 - `bench_serving.py`: Benchmark online serving with dynamic requests.
+- `check_env.py`: Check the environment variables.
 - `global_config.py`: The global configs and constants.
 - `launch_server.py`: The entry point for launching the local server.
 - `utils.py`: Common utilities.