85 Commits

Author SHA1 Message Date
Chranos
6831553caa update readme 2026-02-11 17:59:08 +08:00
Chranos
b0d16e0b70 update readme 2026-02-11 17:56:27 +08:00
Chranos
8f6fd0512d update readme 2026-02-11 17:44:53 +08:00
Chranos
29b36a6b61 add deepseekv3 and llama4 2026-02-11 17:28:54 +08:00
Chranos
dd221f3084 add llama4 v0.0.6 2026-02-11 17:25:38 +08:00
Chranos
7b4f7d74c3 add llama4 2026-02-11 16:08:37 +08:00
Chranos
16d41a8fc1 add deepseekv3 and llama4 2026-02-11 16:03:06 +08:00
Chranos
633aa4db30 add deepseekv3 and llama4 2026-02-11 15:58:34 +08:00
Chranos
6eae065dd6 add deepseekv3 and llama4 2026-02-11 15:48:35 +08:00
Chranos
e752946445 add deepseekv3 and llama4 2026-02-11 15:44:44 +08:00
Chranos
7626238695 add deepseekv3 and llama4 2026-02-11 15:40:19 +08:00
Chranos
f3a4d10195 add deepseekv3 and llama4 2026-02-11 15:39:35 +08:00
Chranos
ed6a2aff91 add deepseekv3 and llama4 2026-02-11 15:37:19 +08:00
Chranos
6faa595799 add deepseekv3 and llama4 2026-02-11 15:32:07 +08:00
Chranos
50e02f2011 add deepseekv3 and llama4 2026-02-11 15:27:19 +08:00
Chranos
c584139543 add deepseekv3 and llama4 2026-02-11 15:24:13 +08:00
Chranos
2ad23aa8da add deepseekv3 and llama4 2026-02-11 15:17:07 +08:00
Chranos
86fd3b5a92 add deepseekv3 and llama4 2026-02-11 15:13:14 +08:00
Chranos
eaeb5169e0 add deepseekv3 and llama4 2026-02-11 15:09:59 +08:00
Chranos
44ffd2094a add deepseekv3 and llama4 2026-02-11 15:07:52 +08:00
Chranos
5132af6176 add deepseekv3 and llama4 2026-02-11 15:05:55 +08:00
Chranos
5c4c2222ba add deepseekv3 and llama4 2026-02-11 15:03:30 +08:00
Chranos
026380fddb add deepseekv3 and llama4 2026-02-11 14:56:40 +08:00
Chranos
d9d1f3a724 add deepseekv3 and llama4 2026-02-11 14:39:48 +08:00
Chranos
d93c740e4d add deepseekv3 and llama4 2026-02-11 14:37:00 +08:00
Chranos
153bc4ec7b add deepseekv3 and llama4 2026-02-11 14:32:37 +08:00
Chranos
96ed925486 add deepseekv3 and llama4 2026-02-11 14:30:01 +08:00
Chranos
8ac7afcbd3 add deepseekv3 and llama4 2026-02-11 14:26:59 +08:00
Chranos
128aed196c add deepseekv3 and llama4 2026-02-11 14:19:17 +08:00
Chranos
659ef273c8 add deepseekv3 2026-02-11 13:18:03 +08:00
Chranos
98003e6f8b add deepseekv3 2026-02-11 13:12:46 +08:00
Chranos
094541296e add deepseekv3 2026-02-11 12:28:36 +08:00
Chranos
5a05c22162 add deepseekv3 2026-02-11 11:40:57 +08:00
Chranos
60f3a23d5f add deepseekv3 2026-02-11 11:35:12 +08:00
Chranos
9c1d7cc9ff add qwen3_moe 2026-02-10 18:55:35 +08:00
Chranos
934ed88691 add qwen3_moe v0.0.5 2026-02-10 18:30:48 +08:00
Chranos
fa0219fbf8 add qwen3_moe 2026-02-10 18:22:13 +08:00
Chranos
efbb06147a add qwen3_moe 2026-02-10 18:18:32 +08:00
Chranos
a26729bf7f add qwen3_moe 2026-02-10 18:09:58 +08:00
Chranos
8a613d15bd add qwen3_moe 2026-02-10 18:02:40 +08:00
Chranos
a6f39375e5 debugging 2026-02-10 16:10:28 +08:00
Chranos
afc34d988e debugging 2026-02-10 15:47:48 +08:00
Chranos
fa194c215b add gemma3 v0.0.4 2026-02-10 14:52:56 +08:00
Chranos
5fbe8b20a7 add gemma3 2026-02-10 14:26:03 +08:00
Chranos
2dad4e71c5 add gemma3 2026-02-10 14:15:33 +08:00
Chranos
cb1846cd4f add gemma3 2026-02-10 14:10:04 +08:00
Chranos
81fc273396 add gemma3 2026-02-10 14:06:26 +08:00
Chranos
3ef89630ab add gemma3 2026-02-10 13:00:25 +08:00
Chranos
40dee08f7b fix: handle missing tie_word_embeddings attr in MPTConfig
Use getattr with default True for MPTConfig.tie_word_embeddings,
as some MPT model configs lack this attribute.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-02-09 17:47:18 +08:00
Chranos
1d70f93cfc debugging 2026-02-09 15:24:55 +08:00