Chranos
9acbda437e
add deepseekv3
2026-02-11 17:47:15 +08:00
Chranos
dfb4cff2fc
add deepseekv3
2026-02-11 17:47:15 +08:00
Chranos
6c222e8f14
add deepseekv3
2026-02-11 17:47:15 +08:00
Chranos
3ec228b6fa
add deepseekv3
2026-02-11 17:47:15 +08:00
Chranos
463fbf8cd1
add qwen3_moe
2026-02-11 17:47:15 +08:00
Chranos
6f6997bafb
add qwen3_moe
2026-02-11 17:47:14 +08:00
Chranos
6479429662
add qwen3_moe
2026-02-11 17:47:14 +08:00
Chranos
2a9f483af8
add qwen3_moe
2026-02-11 17:47:14 +08:00
Chranos
cf92e95688
add qwen3_moe
2026-02-11 17:47:14 +08:00
Chranos
d7f5ef1db9
add qwen3_moe
2026-02-11 17:47:14 +08:00
Chranos
de8fc97532
debugging
2026-02-11 17:47:14 +08:00
Chranos
893eeb2208
debugging
2026-02-11 17:47:14 +08:00
Chranos
8f2ae4f67e
add gemma3
2026-02-11 17:47:14 +08:00
Chranos
89dc931222
add gemma3
2026-02-11 17:47:14 +08:00
Chranos
a7028ae481
add gemma3
2026-02-11 17:47:14 +08:00
Chranos
2e24d45668
add gemma3
2026-02-11 17:47:14 +08:00
Chranos
5b9e02990a
add gemma3
2026-02-11 17:47:14 +08:00
Chranos
ff94650fd1
add gemma3
2026-02-11 17:47:14 +08:00
Chranos
464beead22
fix: handle missing tie_word_embeddings attr in MPTConfig
...
Use getattr with default True for MPTConfig.tie_word_embeddings,
as some MPT model configs lack this attribute.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com >
2026-02-11 17:47:14 +08:00
Chranos
ad087d5cf3
debugging
2026-02-11 17:47:14 +08:00
Chranos
6b708a43d8
fix: add logger import to llama.py for unknown weight skip warning
...
The previous commit added a warning log for skipping unknown weights
(e.g. embed_tokens.biases) but missed importing the logger.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com >
2026-02-11 17:47:14 +08:00
Chranos
8efce7c44c
debugging
2026-02-11 17:47:14 +08:00
Chranos
66d146dfad
update README
2026-02-11 17:47:14 +08:00
Chranos
c35d463486
update README
2026-02-11 17:47:14 +08:00
Chranos
7420866d4c
fixed kvcache bug
2026-02-11 17:47:14 +08:00
Chranos
3fed2190ad
fixing kvcache bug
2026-02-06 16:39:42 +08:00
Chranos
c1b6f39a11
fix: pass lm_head to LogitsProcessor instead of calling forward()
...
In vLLM v0.6.2, ParallelLMHead.forward() raises RuntimeError since
its weights should be used through LogitsProcessor.linear_method.apply().
Pass lm_head as first arg to LogitsProcessor which handles the
hidden_states -> logits projection internally.
2026-02-06 15:05:49 +08:00
Chranos
3e301ce158
testing dynamic register
2026-02-06 15:05:49 +08:00
Chranos
87f96e1001
testing dynamic register
2026-02-06 15:05:49 +08:00
Chranos
e1a2afd244
testing dynamic register
2026-02-06 15:05:49 +08:00
Chranos
63a1a05999
testing dynamic register
2026-02-06 15:05:49 +08:00
Chranos
6d814b0cd4
testing dynamic register
2026-02-06 15:05:48 +08:00
Chranos
dc239a740c
testing dynamic register
2026-02-06 15:05:48 +08:00
Chranos
a476b6458b
testing dynamic register
2026-02-06 15:05:48 +08:00
Chranos
80e9a636af
testing dynamic register
2026-02-06 15:05:48 +08:00
Chranos
16353d5d2a
testing dynamic register
2026-02-06 15:05:48 +08:00
Chranos
70bee4e3ec
testing dynamic register
2026-02-06 15:05:48 +08:00
Chranos
83c958a7c5
testing dynamic register
2026-02-06 15:05:48 +08:00
Chranos
9b84dd52be
testing dynamic register
2026-02-06 15:05:48 +08:00
Chranos
2cb9f6ce1d
testing dynamic register
2026-02-06 15:05:48 +08:00
31e7cd3bf9
删除 .DS_Store
2026-02-05 16:21:10 +08:00
Chranos
6b650ae280
add gitignore
2026-02-05 16:19:33 +08:00
Chranos
92f0016e6f
add dynamic register
2026-02-05 15:53:43 +08:00
Chranos
9563c9af0d
opt llama3
2026-02-05 11:53:52 +08:00
Chranos
3b3e614cb6
opt llama3
2026-02-05 11:42:01 +08:00
Chranos
3cf13dd8c5
add ops
2026-02-04 17:51:35 +08:00
Chranos
79dfc69789
add ops
v0.0.2
2026-02-04 17:39:32 +08:00
Chranos
8511fe8530
add qwen3
v0.0.1
2026-02-04 17:22:39 +08:00
d1c0f68ab4
add more test results
2025-10-17 14:22:33 +08:00
zhousha
5981370919
update Dockerfile
2025-09-19 11:20:33 +08:00