Lianmin Zheng
|
14229ccf8f
|
Move mem_fraction_static adjustment for multimodal models to server_args.py & Fix session control & Other cleanups (#7748)
|
2025-07-04 16:33:33 -07:00 |
|
Yi Zhang
|
264dc6e744
|
[optimize] add two stream norm for qwen3 (#7740)
Co-authored-by: ispobock <ispobaoke@gmail.com>
|
2025-07-03 09:59:17 -07:00 |
|
Yi Zhang
|
646cef2e2e
|
support qwen3 dense model dp attention (#7681)
|
2025-07-03 09:58:20 -07:00 |
|
Pan Lyu
|
451ffe74d9
|
support qwen3 emebedding (#6990)
|
2025-06-09 01:32:49 -07:00 |
|
Shenggui Li
|
3f23d8cdf1
|
added support for tied weights in qwen pipeline parallelism (#6546)
|
2025-05-25 00:00:56 -07:00 |
|
libra
|
11553c1a37
|
Add pipeline parallelism for Qwen2 and Qwen3 Model (#6250)
|
2025-05-18 00:42:55 -07:00 |
|
yhyang201
|
4db463b1ad
|
[Model] Adding Qwen3 and Qwen3MoE (#4693)
|
2025-04-18 09:51:29 -07:00 |
|