2024-08-02 03:12:20 +08:00
|
|
|
# Code Structures
|
|
|
|
|
|
2024-07-13 15:00:26 -07:00
|
|
|
- `lang`: The frontend language.
|
2024-07-19 09:54:01 -07:00
|
|
|
- `srt`: The backend engine for running local models. (SRT = SGLang Runtime).
|
2024-09-01 02:36:56 -07:00
|
|
|
- `test`: The test utilities.
|
|
|
|
|
- `api.py`: The public APIs.
|
2024-10-19 12:58:55 -07:00
|
|
|
- `bench_latency.py`: Benchmark the latency of running a single static batch.
|
|
|
|
|
- `bench_server_latency.py`: Benchmark the latency of serving a single batch with a real server.
|
2024-07-19 10:58:03 -07:00
|
|
|
- `bench_serving.py`: Benchmark online serving with dynamic requests.
|
2024-07-13 15:00:26 -07:00
|
|
|
- `global_config.py`: The global configs and constants.
|
2024-09-12 23:29:44 -07:00
|
|
|
- `launch_server.py`: The entry point for launching the local server.
|
2024-07-13 15:00:26 -07:00
|
|
|
- `utils.py`: Common utilities.
|