Xuan-Son Nguyen
8f22dc0a53
model : add hunyuan moe (#14425)
* model : add hunyuan moe
* tokenizer ok
* fix tensor name
* cgraph init
* chat template
* wip
* almost working
* skip embed, fix bos
* cleanup
* yarn scaling
* cleanup
* correct rope type
* failed token fix
* ntk alpha freq_base
* tokenization working
* cleanup and pr changes
* vocab_size sanity check
* ntk alpha generic
* Update convert_hf_to_gguf.py
* Apply suggestions from code review
* fix regression
* fix style
---------
Co-authored-by: kooshi <1934337+kooshi@users.noreply.github.com>
2025-07-08 11:24:06 +03:00
..
2025-05-29 15:36:05 +02:00
2024-07-18 20:40:15 +10:00
2025-07-08 11:24:06 +03:00
2025-05-21 16:33:54 +02:00
2025-07-03 10:03:06 +02:00
2023-11-11 08:04:50 +03:00
2025-04-08 09:03:07 +02:00
2025-02-28 17:44:46 +01:00
2023-08-30 11:25:50 +03:00
2024-09-05 21:48:47 -04:00
2025-07-08 11:24:06 +03:00
2025-05-28 23:50:20 +02:00
2025-07-02 21:02:35 +02:00