sglang

Author	SHA1	Message	Date
applesaucethebun	2ce8793519	Add typo checker in pre-commit (#6179 ) Co-authored-by: Brayden Zhong <b8zhong@uwaterloo.ca>	2025-05-11 12:55:00 +08:00
Lianmin Zheng	61f42b5732	Move sgl.Runtime under sglang/lang (#2990 )	2025-01-19 17:10:29 -08:00
Lianmin Zheng	f65c13b559	Remove normalized_prompt_logprobs from the engine to make code easier to maintain (#2902 )	2025-01-15 04:54:14 -08:00
Lianmin Zheng	835f8afc77	Migrate llama_classification to use the /classify interface (#2417 )	2024-12-08 23:30:51 -08:00
Yineng Zhang	3dbd73d319	minor: rm unused _grouped_size_compiled_for_decode_kernels (#2299 )	2024-12-01 19:24:12 +08:00
Yineng Zhang	118b6af35e	feat: add should_use_tensor_core (#2179 )	2024-12-01 18:01:16 +08:00
Liangsheng Yin	99ec439da4	Organize Attention Backends (#1547 )	2024-09-30 15:54:18 -07:00
Lianmin Zheng	3a6e8b6d78	[Minor] move triton attention kernels into a separate folder (#1379 )	2024-09-10 15:15:08 -07:00
Lianmin Zheng	f64eae3a29	[Fix] Reduce memory usage for loading llava model & Remove EntryClassRemapping (#1308 )	2024-09-02 21:44:45 -07:00
Lianmin Zheng	f6af3a6561	Cleanup readme, llava examples, usage examples and nccl init (#1194 )	2024-08-24 08:02:23 -07:00
foszto	c62d560c03	#590 Increase default , track changes in examples and documentation (#971 ) Co-authored-by: Ying Sheng <sqy1415@gmail.com>	2024-08-08 00:54:46 +00:00
Ying Sheng	3bc99e6fe4	Test openai vision api (#925 )	2024-08-05 13:51:55 +10:00
Ying Sheng	995af5a54b	Improve the structure of CI (#911 )	2024-08-03 23:09:21 -07:00