Logo
Explore Help
Register Sign In
EngineX-Hygon/sglang
5
0
Fork 0
You've already forked sglang
Code Issues Pull Requests Actions 7 Projects Releases Wiki Activity
Files
bbc07c4197fd2a40e0ba8aa53ce1d69c116e3081
sglang/python/sglang/srt
History
Lianmin Zheng bbc07c4197 Move sampling logits to float32 (#773)
2024-07-27 17:30:12 -07:00
..
constrained
Fix TransformerTokenizer init for chatglm2 & 3 (#761)
2024-07-27 02:44:46 -07:00
layers
Move sampling logits to float32 (#773)
2024-07-27 17:30:12 -07:00
managers
Move sampling logits to float32 (#773)
2024-07-27 17:30:12 -07:00
model_loader
refactor model loader: initial refactor (#664)
2024-07-20 02:18:22 -07:00
models
Deepseek v2 support (#693)
2024-07-26 17:10:07 -07:00
openai_api
Fix max_tokens for OpenAI chat completion API (#766)
2024-07-27 15:44:27 -07:00
conversation.py
Update OpenAI API (#667)
2024-07-19 23:20:54 -07:00
flush_cache.py
Improve doc strings (#518)
2024-06-08 02:39:32 -07:00
hf_transformers_utils.py
Fix context length (#757)
2024-07-26 18:13:13 -07:00
memory_pool.py
Fix prefill size (#711)
2024-07-24 03:41:15 -07:00
mm_utils.py
Handle grayscale images in expand2square (#97)
2024-01-24 16:23:11 -08:00
model_config.py
Deepseek v2 support (#693)
2024-07-26 17:10:07 -07:00
sampling_params.py
Fix max_tokens for OpenAI chat completion API (#766)
2024-07-27 15:44:27 -07:00
server_args.py
Deepseek v2 support (#693)
2024-07-26 17:10:07 -07:00
server.py
Fix bugs (fp8 checkpoints, triton cache manager) (#729)
2024-07-25 07:42:00 -07:00
utils.py
fix: small bug for llama-405b fp16 (#733)
2024-07-25 21:14:54 -07:00
Powered by Gitea Version: 1.24.3 Page: 88ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API