Commit Graph

37 Commits

Author SHA1 Message Date
Chranos
464beead22 fix: handle missing tie_word_embeddings attr in MPTConfig
Use getattr with default True for MPTConfig.tie_word_embeddings,
as some MPT model configs lack this attribute.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-02-11 17:47:14 +08:00
Chranos
ad087d5cf3 debugging 2026-02-11 17:47:14 +08:00
Chranos
6b708a43d8 fix: add logger import to llama.py for unknown weight skip warning
The previous commit added a warning log for skipping unknown weights
(e.g. embed_tokens.biases) but missed importing the logger.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-02-11 17:47:14 +08:00
Chranos
8efce7c44c debugging 2026-02-11 17:47:14 +08:00
Chranos
66d146dfad update README 2026-02-11 17:47:14 +08:00
Chranos
c35d463486 update README 2026-02-11 17:47:14 +08:00
Chranos
7420866d4c fixed kvcache bug 2026-02-11 17:47:14 +08:00
Chranos
3fed2190ad fixing kvcache bug 2026-02-06 16:39:42 +08:00
Chranos
c1b6f39a11 fix: pass lm_head to LogitsProcessor instead of calling forward()
In vLLM v0.6.2, ParallelLMHead.forward() raises RuntimeError since
its weights should be used through LogitsProcessor.linear_method.apply().
Pass lm_head as first arg to LogitsProcessor which handles the
hidden_states -> logits projection internally.
2026-02-06 15:05:49 +08:00
Chranos
3e301ce158 testing dynamic register 2026-02-06 15:05:49 +08:00
Chranos
87f96e1001 testing dynamic register 2026-02-06 15:05:49 +08:00
Chranos
e1a2afd244 testing dynamic register 2026-02-06 15:05:49 +08:00
Chranos
63a1a05999 testing dynamic register 2026-02-06 15:05:49 +08:00
Chranos
6d814b0cd4 testing dynamic register 2026-02-06 15:05:48 +08:00
Chranos
dc239a740c testing dynamic register 2026-02-06 15:05:48 +08:00
Chranos
a476b6458b testing dynamic register 2026-02-06 15:05:48 +08:00
Chranos
80e9a636af testing dynamic register 2026-02-06 15:05:48 +08:00
Chranos
16353d5d2a testing dynamic register 2026-02-06 15:05:48 +08:00
Chranos
70bee4e3ec testing dynamic register 2026-02-06 15:05:48 +08:00
Chranos
83c958a7c5 testing dynamic register 2026-02-06 15:05:48 +08:00
Chranos
9b84dd52be testing dynamic register 2026-02-06 15:05:48 +08:00
Chranos
2cb9f6ce1d testing dynamic register 2026-02-06 15:05:48 +08:00
31e7cd3bf9 删除 .DS_Store 2026-02-05 16:21:10 +08:00
Chranos
6b650ae280 add gitignore 2026-02-05 16:19:33 +08:00
Chranos
92f0016e6f add dynamic register 2026-02-05 15:53:43 +08:00
Chranos
9563c9af0d opt llama3 2026-02-05 11:53:52 +08:00
Chranos
3b3e614cb6 opt llama3 2026-02-05 11:42:01 +08:00
Chranos
3cf13dd8c5 add ops 2026-02-04 17:51:35 +08:00
Chranos
79dfc69789 add ops v0.0.2 2026-02-04 17:39:32 +08:00
Chranos
8511fe8530 add qwen3 v0.0.1 2026-02-04 17:22:39 +08:00
d1c0f68ab4 add more test results 2025-10-17 14:22:33 +08:00
zhousha
5981370919 update Dockerfile 2025-09-19 11:20:33 +08:00
608fa14c31 feature: add 2025-08-27 16:55:00 +08:00
fdf948f401 feature: add 2025-08-27 16:54:23 +08:00
b9e9a99423 feature: add 2025-08-27 16:52:53 +08:00
62f765bf4b feature: add 2025-08-27 16:49:09 +08:00
461c66ccc0 feature: add 2025-08-26 14:23:17 +08:00