Chranos
91cd25a8d1
update readme
2026-02-11 17:59:22 +08:00
Chranos
bc9ae6a58a
update readme
2026-02-11 17:57:20 +08:00
Chranos
cfc0614191
update readme
2026-02-11 17:47:15 +08:00
Chranos
01560f8227
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
19c3cfb624
add llama4
2026-02-11 17:47:15 +08:00
Chranos
8a85f8580f
add llama4
2026-02-11 17:47:15 +08:00
Chranos
5457f79dbb
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
1f77771852
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
e0bd67be53
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
d860f71e4d
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
8657cbec87
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
597187b7e5
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
72507b7703
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
a69129d5b5
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
f6d6f69abc
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
9b05d7285e
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
cba7ad6c59
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
db876765ed
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
78814aaa68
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
45e1fa8bb3
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
1a3e04b0e4
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
5ed7baa68e
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
a21eae79a1
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
5da783780d
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
5c980830a0
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
00083a1c76
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
4ed73b2ef6
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
386b7ec8c7
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
01c0b3d345
add deepseekv3 and llama4
2026-02-11 17:47:15 +08:00
Chranos
4a72c4c91a
add deepseekv3
2026-02-11 17:47:15 +08:00
Chranos
9acbda437e
add deepseekv3
2026-02-11 17:47:15 +08:00
Chranos
dfb4cff2fc
add deepseekv3
2026-02-11 17:47:15 +08:00
Chranos
6c222e8f14
add deepseekv3
2026-02-11 17:47:15 +08:00
Chranos
3ec228b6fa
add deepseekv3
2026-02-11 17:47:15 +08:00
Chranos
463fbf8cd1
add qwen3_moe
2026-02-11 17:47:15 +08:00
Chranos
6f6997bafb
add qwen3_moe
2026-02-11 17:47:14 +08:00
Chranos
6479429662
add qwen3_moe
2026-02-11 17:47:14 +08:00
Chranos
2a9f483af8
add qwen3_moe
2026-02-11 17:47:14 +08:00
Chranos
cf92e95688
add qwen3_moe
2026-02-11 17:47:14 +08:00
Chranos
d7f5ef1db9
add qwen3_moe
2026-02-11 17:47:14 +08:00
Chranos
de8fc97532
debugging
2026-02-11 17:47:14 +08:00
Chranos
893eeb2208
debugging
2026-02-11 17:47:14 +08:00
Chranos
8f2ae4f67e
add gemma3
2026-02-11 17:47:14 +08:00
Chranos
89dc931222
add gemma3
2026-02-11 17:47:14 +08:00
Chranos
a7028ae481
add gemma3
2026-02-11 17:47:14 +08:00
Chranos
2e24d45668
add gemma3
2026-02-11 17:47:14 +08:00
Chranos
5b9e02990a
add gemma3
2026-02-11 17:47:14 +08:00
Chranos
ff94650fd1
add gemma3
2026-02-11 17:47:14 +08:00
Chranos
464beead22
fix: handle missing tie_word_embeddings attr in MPTConfig
...
Use getattr with default True for MPTConfig.tie_word_embeddings,
as some MPT model configs lack this attribute.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com >
2026-02-11 17:47:14 +08:00
Chranos
ad087d5cf3
debugging
2026-02-11 17:47:14 +08:00