Chranos
374826c841
fixing kvcache bug
2026-02-06 16:25:54 +08:00
Chranos
ebdc6fed03
fix: pass lm_head to LogitsProcessor instead of calling forward()
...
In vLLM v0.6.2, ParallelLMHead.forward() raises RuntimeError since
its weights should be used through LogitsProcessor.linear_method.apply().
Pass lm_head as first arg to LogitsProcessor which handles the
hidden_states -> logits projection internally.
2026-02-06 14:21:14 +08:00
Chranos
b702adf015
testing dynamic register
2026-02-06 14:17:06 +08:00
Chranos
fba02652c8
testing dynamic register
2026-02-06 14:04:04 +08:00
Chranos
5d2f4000cc
testing dynamic register
2026-02-06 13:51:02 +08:00
Chranos
f088a6b45d
testing dynamic register
2026-02-06 13:39:13 +08:00
Chranos
d31ace279b
testing dynamic register
2026-02-05 18:57:04 +08:00
Chranos
ac2082ff36
testing dynamic register
2026-02-05 18:48:11 +08:00
Chranos
2068984bde
testing dynamic register
2026-02-05 18:36:03 +08:00
Chranos
df848b4284
testing dynamic register
2026-02-05 18:24:33 +08:00
Chranos
4d0da98b9e
testing dynamic register
2026-02-05 18:21:31 +08:00
Chranos
05605419e3
testing dynamic register
2026-02-05 18:08:05 +08:00
Chranos
332e5f71a6
testing dynamic register
2026-02-05 18:02:59 +08:00
Chranos
6e38461af6
testing dynamic register
2026-02-05 17:11:09 +08:00
Chranos
b399840b8d
testing dynamic register
2026-02-05 16:30:44 +08:00
Chranos
92f0016e6f
add dynamic register
2026-02-05 15:53:43 +08:00
Chranos
9563c9af0d
opt llama3
2026-02-05 11:53:52 +08:00
Chranos
8511fe8530
add qwen3
2026-02-04 17:22:39 +08:00