Chranos
dd221f3084
add llama4
2026-02-11 17:25:38 +08:00
Chranos
7b4f7d74c3
add llama4
2026-02-11 16:08:37 +08:00
Chranos
16d41a8fc1
add deepseekv3 and llama4
2026-02-11 16:03:06 +08:00
Chranos
633aa4db30
add deepseekv3 and llama4
2026-02-11 15:58:34 +08:00
Chranos
6eae065dd6
add deepseekv3 and llama4
2026-02-11 15:48:35 +08:00
Chranos
e752946445
add deepseekv3 and llama4
2026-02-11 15:44:44 +08:00
Chranos
7626238695
add deepseekv3 and llama4
2026-02-11 15:40:19 +08:00
Chranos
f3a4d10195
add deepseekv3 and llama4
2026-02-11 15:39:35 +08:00
Chranos
ed6a2aff91
add deepseekv3 and llama4
2026-02-11 15:37:19 +08:00
Chranos
6faa595799
add deepseekv3 and llama4
2026-02-11 15:32:07 +08:00
Chranos
50e02f2011
add deepseekv3 and llama4
2026-02-11 15:27:19 +08:00
Chranos
c584139543
add deepseekv3 and llama4
2026-02-11 15:24:13 +08:00
Chranos
2ad23aa8da
add deepseekv3 and llama4
2026-02-11 15:17:07 +08:00
Chranos
86fd3b5a92
add deepseekv3 and llama4
2026-02-11 15:13:14 +08:00
Chranos
eaeb5169e0
add deepseekv3 and llama4
2026-02-11 15:09:59 +08:00
Chranos
44ffd2094a
add deepseekv3 and llama4
2026-02-11 15:07:52 +08:00
Chranos
5132af6176
add deepseekv3 and llama4
2026-02-11 15:05:55 +08:00
Chranos
5c4c2222ba
add deepseekv3 and llama4
2026-02-11 15:03:30 +08:00
Chranos
026380fddb
add deepseekv3 and llama4
2026-02-11 14:56:40 +08:00
Chranos
d9d1f3a724
add deepseekv3 and llama4
2026-02-11 14:39:48 +08:00
Chranos
d93c740e4d
add deepseekv3 and llama4
2026-02-11 14:37:00 +08:00
Chranos
153bc4ec7b
add deepseekv3 and llama4
2026-02-11 14:32:37 +08:00
Chranos
96ed925486
add deepseekv3 and llama4
2026-02-11 14:30:01 +08:00
Chranos
8ac7afcbd3
add deepseekv3 and llama4
2026-02-11 14:26:59 +08:00
Chranos
128aed196c
add deepseekv3 and llama4
2026-02-11 14:19:17 +08:00
Chranos
659ef273c8
add deepseekv3
2026-02-11 13:18:03 +08:00
Chranos
98003e6f8b
add deepseekv3
2026-02-11 13:12:46 +08:00
Chranos
094541296e
add deepseekv3
2026-02-11 12:28:36 +08:00
Chranos
5a05c22162
add deepseekv3
2026-02-11 11:40:57 +08:00
Chranos
60f3a23d5f
add deepseekv3
2026-02-11 11:35:12 +08:00
Chranos
934ed88691
add qwen3_moe
2026-02-10 18:30:48 +08:00
Chranos
fa0219fbf8
add qwen3_moe
2026-02-10 18:22:13 +08:00
Chranos
efbb06147a
add qwen3_moe
2026-02-10 18:18:32 +08:00
Chranos
a26729bf7f
add qwen3_moe
2026-02-10 18:09:58 +08:00
Chranos
8a613d15bd
add qwen3_moe
2026-02-10 18:02:40 +08:00
Chranos
a6f39375e5
debugging
2026-02-10 16:10:28 +08:00
Chranos
afc34d988e
debugging
2026-02-10 15:47:48 +08:00
Chranos
5fbe8b20a7
add gemma3
2026-02-10 14:26:03 +08:00
Chranos
2dad4e71c5
add gemma3
2026-02-10 14:15:33 +08:00
Chranos
cb1846cd4f
add gemma3
2026-02-10 14:10:04 +08:00
Chranos
81fc273396
add gemma3
2026-02-10 14:06:26 +08:00
Chranos
3ef89630ab
add gemma3
2026-02-10 13:00:25 +08:00
Chranos
40dee08f7b
fix: handle missing tie_word_embeddings attr in MPTConfig
...
Use getattr with default True for MPTConfig.tie_word_embeddings,
as some MPT model configs lack this attribute.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com >
2026-02-09 17:47:18 +08:00
Chranos
1d70f93cfc
debugging
2026-02-09 15:24:55 +08:00
Chranos
8ecba6115e
fix: add logger import to llama.py for unknown weight skip warning
...
The previous commit added a warning log for skipping unknown weights
(e.g. embed_tokens.biases) but missed importing the logger.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com >
2026-02-09 13:13:56 +08:00
Chranos
65ad893ee7
debugging
2026-02-09 13:00:35 +08:00
Chranos
a095dede48
fixed kvcache bug
2026-02-06 17:10:36 +08:00
Chranos
374826c841
fixing kvcache bug
2026-02-06 16:25:54 +08:00
Chranos
ebdc6fed03
fix: pass lm_head to LogitsProcessor instead of calling forward()
...
In vLLM v0.6.2, ParallelLMHead.forward() raises RuntimeError since
its weights should be used through LogitsProcessor.linear_method.apply().
Pass lm_head as first arg to LogitsProcessor which handles the
hidden_states -> logits projection internally.
2026-02-06 14:21:14 +08:00
Chranos
b702adf015
testing dynamic register
2026-02-06 14:17:06 +08:00