50 Commits

Author SHA1 Message Date
Chranos
934ed88691 add qwen3_moe v0.0.5 2026-02-10 18:30:48 +08:00
Chranos
fa0219fbf8 add qwen3_moe 2026-02-10 18:22:13 +08:00
Chranos
efbb06147a add qwen3_moe 2026-02-10 18:18:32 +08:00
Chranos
a26729bf7f add qwen3_moe 2026-02-10 18:09:58 +08:00
Chranos
8a613d15bd add qwen3_moe 2026-02-10 18:02:40 +08:00
Chranos
a6f39375e5 debugging 2026-02-10 16:10:28 +08:00
Chranos
afc34d988e debugging 2026-02-10 15:47:48 +08:00
Chranos
fa194c215b add gemma3 v0.0.4 2026-02-10 14:52:56 +08:00
Chranos
5fbe8b20a7 add gemma3 2026-02-10 14:26:03 +08:00
Chranos
2dad4e71c5 add gemma3 2026-02-10 14:15:33 +08:00
Chranos
cb1846cd4f add gemma3 2026-02-10 14:10:04 +08:00
Chranos
81fc273396 add gemma3 2026-02-10 14:06:26 +08:00
Chranos
3ef89630ab add gemma3 2026-02-10 13:00:25 +08:00
Chranos
40dee08f7b fix: handle missing tie_word_embeddings attr in MPTConfig
Use getattr with default True for MPTConfig.tie_word_embeddings,
as some MPT model configs lack this attribute.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-02-09 17:47:18 +08:00
Chranos
1d70f93cfc debugging 2026-02-09 15:24:55 +08:00
Chranos
8ecba6115e fix: add logger import to llama.py for unknown weight skip warning
The previous commit added a warning log for skipping unknown weights
(e.g. embed_tokens.biases) but missed importing the logger.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-02-09 13:13:56 +08:00
Chranos
65ad893ee7 debugging 2026-02-09 13:00:35 +08:00
Chranos
d08217307d update README 2026-02-09 11:46:04 +08:00
Chranos
8ac4215755 update README 2026-02-09 11:44:52 +08:00
Chranos
a095dede48 fixed kvcache bug 2026-02-06 17:10:36 +08:00
Chranos
374826c841 fixing kvcache bug 2026-02-06 16:25:54 +08:00
Chranos
ebdc6fed03 fix: pass lm_head to LogitsProcessor instead of calling forward()
In vLLM v0.6.2, ParallelLMHead.forward() raises RuntimeError since
its weights should be used through LogitsProcessor.linear_method.apply().
Pass lm_head as first arg to LogitsProcessor which handles the
hidden_states -> logits projection internally.
v0.0.3
2026-02-06 14:21:14 +08:00
Chranos
b702adf015 testing dynamic register 2026-02-06 14:17:06 +08:00
Chranos
fba02652c8 testing dynamic register 2026-02-06 14:04:04 +08:00
Chranos
5d2f4000cc testing dynamic register 2026-02-06 13:51:02 +08:00
Chranos
f088a6b45d testing dynamic register 2026-02-06 13:39:13 +08:00
Chranos
d31ace279b testing dynamic register 2026-02-05 18:57:04 +08:00
Chranos
ac2082ff36 testing dynamic register 2026-02-05 18:48:11 +08:00
Chranos
2068984bde testing dynamic register 2026-02-05 18:36:03 +08:00
Chranos
df848b4284 testing dynamic register 2026-02-05 18:24:33 +08:00
Chranos
4d0da98b9e testing dynamic register 2026-02-05 18:21:31 +08:00
Chranos
05605419e3 testing dynamic register 2026-02-05 18:08:05 +08:00
Chranos
332e5f71a6 testing dynamic register 2026-02-05 18:02:59 +08:00
Chranos
6e38461af6 testing dynamic register 2026-02-05 17:11:09 +08:00
Chranos
b399840b8d testing dynamic register 2026-02-05 16:30:44 +08:00
808b9b7c97 删除 .DS_Store 2026-02-05 16:20:54 +08:00
Chranos
6b650ae280 add gitignore 2026-02-05 16:19:33 +08:00
Chranos
92f0016e6f add dynamic register 2026-02-05 15:53:43 +08:00
Chranos
9563c9af0d opt llama3 2026-02-05 11:53:52 +08:00
Chranos
3b3e614cb6 opt llama3 2026-02-05 11:42:01 +08:00
Chranos
3cf13dd8c5 add ops 2026-02-04 17:51:35 +08:00
Chranos
79dfc69789 add ops v0.0.2 2026-02-04 17:39:32 +08:00
Chranos
8511fe8530 add qwen3 2026-02-04 17:22:39 +08:00
d1c0f68ab4 add more test results 2025-10-17 14:22:33 +08:00
zhousha
5981370919 update Dockerfile 2025-09-19 11:20:33 +08:00
608fa14c31 feature: add 2025-08-27 16:55:00 +08:00
fdf948f401 feature: add 2025-08-27 16:54:23 +08:00
b9e9a99423 feature: add 2025-08-27 16:52:53 +08:00
62f765bf4b feature: add 2025-08-27 16:49:09 +08:00
461c66ccc0 feature: add 2025-08-26 14:23:17 +08:00