85 Commits

Author SHA1 Message Date
Chranos
91cd25a8d1 update readme 2026-02-11 17:59:22 +08:00
Chranos
bc9ae6a58a update readme 2026-02-11 17:57:20 +08:00
Chranos
cfc0614191 update readme 2026-02-11 17:47:15 +08:00
Chranos
01560f8227 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
19c3cfb624 add llama4 2026-02-11 17:47:15 +08:00
Chranos
8a85f8580f add llama4 2026-02-11 17:47:15 +08:00
Chranos
5457f79dbb add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
1f77771852 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
e0bd67be53 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
d860f71e4d add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
8657cbec87 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
597187b7e5 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
72507b7703 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
a69129d5b5 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
f6d6f69abc add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
9b05d7285e add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
cba7ad6c59 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
db876765ed add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
78814aaa68 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
45e1fa8bb3 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
1a3e04b0e4 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
5ed7baa68e add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
a21eae79a1 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
5da783780d add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
5c980830a0 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
00083a1c76 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
4ed73b2ef6 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
386b7ec8c7 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
01c0b3d345 add deepseekv3 and llama4 2026-02-11 17:47:15 +08:00
Chranos
4a72c4c91a add deepseekv3 2026-02-11 17:47:15 +08:00
Chranos
9acbda437e add deepseekv3 2026-02-11 17:47:15 +08:00
Chranos
dfb4cff2fc add deepseekv3 2026-02-11 17:47:15 +08:00
Chranos
6c222e8f14 add deepseekv3 2026-02-11 17:47:15 +08:00
Chranos
3ec228b6fa add deepseekv3 2026-02-11 17:47:15 +08:00
Chranos
463fbf8cd1 add qwen3_moe 2026-02-11 17:47:15 +08:00
Chranos
6f6997bafb add qwen3_moe 2026-02-11 17:47:14 +08:00
Chranos
6479429662 add qwen3_moe 2026-02-11 17:47:14 +08:00
Chranos
2a9f483af8 add qwen3_moe 2026-02-11 17:47:14 +08:00
Chranos
cf92e95688 add qwen3_moe 2026-02-11 17:47:14 +08:00
Chranos
d7f5ef1db9 add qwen3_moe 2026-02-11 17:47:14 +08:00
Chranos
de8fc97532 debugging 2026-02-11 17:47:14 +08:00
Chranos
893eeb2208 debugging 2026-02-11 17:47:14 +08:00
Chranos
8f2ae4f67e add gemma3 2026-02-11 17:47:14 +08:00
Chranos
89dc931222 add gemma3 2026-02-11 17:47:14 +08:00
Chranos
a7028ae481 add gemma3 2026-02-11 17:47:14 +08:00
Chranos
2e24d45668 add gemma3 2026-02-11 17:47:14 +08:00
Chranos
5b9e02990a add gemma3 2026-02-11 17:47:14 +08:00
Chranos
ff94650fd1 add gemma3 2026-02-11 17:47:14 +08:00
Chranos
464beead22 fix: handle missing tie_word_embeddings attr in MPTConfig
Use getattr with default True for MPTConfig.tie_word_embeddings,
as some MPT model configs lack this attribute.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-02-11 17:47:14 +08:00
Chranos
ad087d5cf3 debugging 2026-02-11 17:47:14 +08:00