leejet
0a1b3982cd
ggml: add ops for WAN video model (cuda && cpu) (#15669)
* add conv3d support
* add ggml_pad_ext for cpu & cuda backend
* cuda/cpu: add im2col_3d support
* cuda: make im2col a little faster
* fix cuda pad/scale/im2col3d
* make im2col_3d faster
* gguf: support loading tensors which n_dims > GGML_MAX_DIMS
* fix cuda get_rows
* avoid ggml_conv_3d conflict
* correct GGML_OP_COUNT assertion
* avoid build failure
* avoid build failure on MacOS
* cuda: remove unnecessary MIN define
* fix cpu im2col_3d
* adjust the code style
* cuda: use simpler loop in get_rows
* add test_im2col_3d to test-backend-ops
* test-backend-ops.cpp: remove trailing whitespace
* cpu: im2col_3d support non continuous src
Co-authored-by: Jeff Bolz <jbolz@nvidia.com>
* fix test_im2col_3d
* remove unused variables
* cuda: get_rows: dfloat2 -> float2
* add test_pad_ext to test-backend-ops.cpp
* add gguf_init_from_file_ext impl
* Revert "gguf: support loading tensors which n_dims > GGML_MAX_DIMS"
This reverts commit d8377a0a37f314bd3713fe043b4333ad661610c1.
* Revert "add gguf_init_from_file_ext impl"
This reverts commit d9f1d13208c68ef83b3538201ac7f31614fb1994.
* update ggml_backend_vk_device_supports_op
* fix ggml_backend_vk_device_supports_op
* update other backend supports op for ggml_pad_ext
* metal/opencl/sycl/vulkan: fix GGML_OP_PAD check in supports_op
---------
Co-authored-by: Jeff Bolz <jbolz@nvidia.com>
2025-09-04 10:38:49 +02:00
..
2025-02-28 14:41:47 +01:00
2025-08-31 15:49:03 +02:00
2024-11-14 18:04:35 +01:00
2024-11-14 18:04:35 +01:00
2025-05-01 09:58:44 +03:00
2025-06-27 16:41:40 +03:00
2024-11-14 18:04:35 +01:00
2025-02-15 16:40:57 +02:00
2024-12-13 12:23:52 -08:00
2025-08-14 12:03:57 +02:00
2025-04-25 10:08:08 +03:00
2024-11-14 18:04:35 +01:00
2025-02-10 07:17:21 +01:00
2025-07-16 18:18:51 +03:00
2025-08-15 21:11:22 +08:00
2025-09-04 10:38:49 +02:00
2025-01-07 18:01:58 +01:00