This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
c49c1d9226ad0a380aa854f60ea7daf5db191477
sglang
/
python
/
sglang
/
srt
/
lora
History
Lifu Huang
021f76e4f4
[Perf] Refactor LoRAManager to eliminate stream syncs and redundant computations (
#6994
)
2025-06-11 16:18:57 -07:00
..
backend
Revert "fix some typos" (
#6244
)
2025-05-12 12:53:26 -07:00
triton_ops
Revert "fix some typos" (
#6244
)
2025-05-12 12:53:26 -07:00
layers.py
Eliminate stream sync to speed up LoRA batch init (
#6960
)
2025-06-09 00:22:45 -07:00
lora_config.py
[Fix] Fix bugs and refactor codes in lora for better scalability. (
#3652
)
2025-02-20 11:51:57 -08:00
lora_manager.py
[Perf] Refactor LoRAManager to eliminate stream syncs and redundant computations (
#6994
)
2025-06-11 16:18:57 -07:00
lora.py
Support LoRA in TestOpenAIVisionServer and fix fused kv_proj loading bug. (
#6861
)
2025-06-04 22:08:30 -07:00
mem_pool.py
[Perf] Refactor LoRAManager to eliminate stream syncs and redundant computations (
#6994
)
2025-06-11 16:18:57 -07:00
utils.py
Refactor LoRA handling to support adapter tensors in fused format (
#6585
)
2025-05-26 21:51:54 -07:00