Bartowski
e74c92e842
model : support GLM 4.6 (make a few NextN/MTP tensors not required) ( #16359 )
...
* Make a few GLM tensors not required
layer.nextn.shared_head_head and layer.nextn.embed_tokens are both excluded from GLM 4.6 resulting in the model not loading after conversion/quantization, this marks those tensors as not required which makes it work
* Update llama-model.cpp
layer.nextn.shared_head_norm also not required in case of future models
2025-09-30 22:24:36 +02:00
..
2025-08-21 17:00:33 +03:00
2025-09-05 17:32:39 -06:00
2025-09-05 17:32:39 -06:00
2025-09-25 19:50:28 +02:00
2025-09-25 19:50:28 +02:00
2025-08-14 14:03:30 +03:00
2025-07-17 19:08:33 +03:00
2025-09-14 23:00:59 +02:00
2025-09-14 23:00:59 +02:00
2025-09-24 16:53:48 +02:00
2025-09-24 16:53:48 +02:00
2025-06-15 10:08:58 +03:00
2025-09-18 12:47:56 +03:00
2025-05-25 01:48:08 +01:00
2025-03-05 13:05:13 +00:00
2025-09-25 19:50:28 +02:00
2025-09-25 11:53:09 +03:00
2025-09-05 10:39:22 +03:00
2025-09-25 19:50:28 +02:00
2025-01-07 18:01:58 +01:00
2025-08-30 16:32:10 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-09-24 16:53:48 +02:00
2025-09-24 16:53:48 +02:00
2025-09-24 16:53:48 +02:00
2025-09-24 16:53:48 +02:00
2025-08-22 12:22:13 +03:00
2025-09-24 16:53:48 +02:00
2025-09-24 16:53:48 +02:00
2025-09-24 16:53:48 +02:00
2025-09-24 16:53:48 +02:00
2025-06-30 18:03:03 +03:00
2025-09-24 16:53:48 +02:00
2025-06-05 11:57:42 +02:00
2025-02-10 20:58:18 +02:00
2025-08-28 18:39:31 -06:00
2025-08-04 20:29:25 +02:00
2025-06-20 14:04:09 +02:00
2025-05-12 14:44:49 +02:00
2025-09-30 22:24:36 +02:00
2025-09-25 19:50:28 +02:00
2025-09-17 09:30:55 +02:00
2025-01-03 10:18:53 +02:00
2025-09-03 18:16:26 +03:00
2025-01-12 11:32:42 +02:00
2025-09-27 02:03:33 +08:00
2025-09-14 23:00:59 +02:00
2025-09-11 22:47:38 +02:00
2024-10-08 13:27:04 +02:00
2024-10-02 15:49:55 +02:00
2025-07-15 21:54:22 +02:00
2025-09-27 02:03:33 +08:00