Chranos
934ed88691
add qwen3_moe
v0.0.5
2026-02-10 18:30:48 +08:00
Chranos
fa0219fbf8
add qwen3_moe
2026-02-10 18:22:13 +08:00
Chranos
efbb06147a
add qwen3_moe
2026-02-10 18:18:32 +08:00
Chranos
a26729bf7f
add qwen3_moe
2026-02-10 18:09:58 +08:00
Chranos
8a613d15bd
add qwen3_moe
2026-02-10 18:02:40 +08:00
Chranos
a6f39375e5
debugging
2026-02-10 16:10:28 +08:00
Chranos
afc34d988e
debugging
2026-02-10 15:47:48 +08:00
Chranos
fa194c215b
add gemma3
v0.0.4
2026-02-10 14:52:56 +08:00
Chranos
5fbe8b20a7
add gemma3
2026-02-10 14:26:03 +08:00
Chranos
2dad4e71c5
add gemma3
2026-02-10 14:15:33 +08:00
Chranos
cb1846cd4f
add gemma3
2026-02-10 14:10:04 +08:00
Chranos
81fc273396
add gemma3
2026-02-10 14:06:26 +08:00
Chranos
3ef89630ab
add gemma3
2026-02-10 13:00:25 +08:00
Chranos
40dee08f7b
fix: handle missing tie_word_embeddings attr in MPTConfig
...
Use getattr with default True for MPTConfig.tie_word_embeddings,
as some MPT model configs lack this attribute.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com >
2026-02-09 17:47:18 +08:00
Chranos
1d70f93cfc
debugging
2026-02-09 15:24:55 +08:00
Chranos
8ecba6115e
fix: add logger import to llama.py for unknown weight skip warning
...
The previous commit added a warning log for skipping unknown weights
(e.g. embed_tokens.biases) but missed importing the logger.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com >
2026-02-09 13:13:56 +08:00
Chranos
65ad893ee7
debugging
2026-02-09 13:00:35 +08:00
Chranos
d08217307d
update README
2026-02-09 11:46:04 +08:00
Chranos
8ac4215755
update README
2026-02-09 11:44:52 +08:00
Chranos
a095dede48
fixed kvcache bug
2026-02-06 17:10:36 +08:00
Chranos
374826c841
fixing kvcache bug
2026-02-06 16:25:54 +08:00
Chranos
ebdc6fed03
fix: pass lm_head to LogitsProcessor instead of calling forward()
...
In vLLM v0.6.2, ParallelLMHead.forward() raises RuntimeError since
its weights should be used through LogitsProcessor.linear_method.apply().
Pass lm_head as first arg to LogitsProcessor which handles the
hidden_states -> logits projection internally.
v0.0.3
2026-02-06 14:21:14 +08:00
Chranos
b702adf015
testing dynamic register
2026-02-06 14:17:06 +08:00
Chranos
fba02652c8
testing dynamic register
2026-02-06 14:04:04 +08:00
Chranos
5d2f4000cc
testing dynamic register
2026-02-06 13:51:02 +08:00
Chranos
f088a6b45d
testing dynamic register
2026-02-06 13:39:13 +08:00
Chranos
d31ace279b
testing dynamic register
2026-02-05 18:57:04 +08:00
Chranos
ac2082ff36
testing dynamic register
2026-02-05 18:48:11 +08:00
Chranos
2068984bde
testing dynamic register
2026-02-05 18:36:03 +08:00
Chranos
df848b4284
testing dynamic register
2026-02-05 18:24:33 +08:00
Chranos
4d0da98b9e
testing dynamic register
2026-02-05 18:21:31 +08:00
Chranos
05605419e3
testing dynamic register
2026-02-05 18:08:05 +08:00
Chranos
332e5f71a6
testing dynamic register
2026-02-05 18:02:59 +08:00
Chranos
6e38461af6
testing dynamic register
2026-02-05 17:11:09 +08:00
Chranos
b399840b8d
testing dynamic register
2026-02-05 16:30:44 +08:00
808b9b7c97
删除 .DS_Store
2026-02-05 16:20:54 +08:00
Chranos
6b650ae280
add gitignore
2026-02-05 16:19:33 +08:00
Chranos
92f0016e6f
add dynamic register
2026-02-05 15:53:43 +08:00
Chranos
9563c9af0d
opt llama3
2026-02-05 11:53:52 +08:00
Chranos
3b3e614cb6
opt llama3
2026-02-05 11:42:01 +08:00
Chranos
3cf13dd8c5
add ops
2026-02-04 17:51:35 +08:00
Chranos
79dfc69789
add ops
v0.0.2
2026-02-04 17:39:32 +08:00
Chranos
8511fe8530
add qwen3
2026-02-04 17:22:39 +08:00
d1c0f68ab4
add more test results
2025-10-17 14:22:33 +08:00
zhousha
5981370919
update Dockerfile
2025-09-19 11:20:33 +08:00
608fa14c31
feature: add
2025-08-27 16:55:00 +08:00
fdf948f401
feature: add
2025-08-27 16:54:23 +08:00
b9e9a99423
feature: add
2025-08-27 16:52:53 +08:00
62f765bf4b
feature: add
2025-08-27 16:49:09 +08:00
461c66ccc0
feature: add
2025-08-26 14:23:17 +08:00