enginex-ascend-910-llama.cpp

acc.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

add.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

argmax.comp

vulkan: implement several ops relevant for ggml_opt (#11769 )

2025-02-17 07:55:57 +01:00

argsort.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

clamp.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

CMakeLists.txt

fix: ggml: fix vulkan-shaders-gen build (#10448 )

2025-01-15 14:17:42 +01:00

concat.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

contig_copy.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

copy_from_quant.comp

vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528 )

2025-02-15 09:01:40 +01:00

copy_to_quant.comp

vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528 )

2025-02-15 09:01:40 +01:00

copy.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

cos.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

count_equal.comp

vulkan: implement several ops relevant for ggml_opt (#11769 )

2025-02-17 07:55:57 +01:00

dequant_f32.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

dequant_funcs_cm2.comp

vulkan: matmul dequantization improvements (#12015 )

2025-02-28 08:20:08 +01:00

dequant_funcs.comp

vulkan: matmul dequantization improvements (#12015 )

2025-02-28 08:20:08 +01:00

dequant_head.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

dequant_iq1_m.comp

vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528 )

2025-02-15 09:01:40 +01:00

dequant_iq1_s.comp

vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528 )

2025-02-15 09:01:40 +01:00

dequant_iq2_s.comp

vulkan: implement initial support for IQ2 and IQ3 quantizations (#11360 )

2025-01-29 18:29:39 +01:00

dequant_iq2_xs.comp

vulkan: implement initial support for IQ2 and IQ3 quantizations (#11360 )

2025-01-29 18:29:39 +01:00

dequant_iq2_xxs.comp

vulkan: implement initial support for IQ2 and IQ3 quantizations (#11360 )

2025-01-29 18:29:39 +01:00

dequant_iq3_s.comp

vulkan: implement initial support for IQ2 and IQ3 quantizations (#11360 )

2025-01-29 18:29:39 +01:00

dequant_iq3_xxs.comp

vulkan: implement initial support for IQ2 and IQ3 quantizations (#11360 )

2025-01-29 18:29:39 +01:00

dequant_iq4_nl.comp

vulkan: implement initial support for IQ2 and IQ3 quantizations (#11360 )

2025-01-29 18:29:39 +01:00

dequant_iq4_xs.comp

vulkan: initial support for IQ4_XS quantization (#11501 )

2025-02-06 07:09:59 +01:00

dequant_q2_k.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

dequant_q3_k.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

dequant_q4_0.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

dequant_q4_1.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

dequant_q4_k.comp

Vulkan: Use improved q4_k and q5_k dequant code in dequant shaders (#10798 )

2024-12-12 18:36:00 +01:00

dequant_q5_0.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

dequant_q5_1.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

dequant_q5_k.comp

Vulkan: Use improved q4_k and q5_k dequant code in dequant shaders (#10798 )

2024-12-12 18:36:00 +01:00

dequant_q6_k.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

dequant_q8_0.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

diag_mask_inf.comp

vulkan: fix diag_mask_inf (#11323 )

2025-01-23 08:01:17 +01:00

div.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

flash_attn_cm2.comp

vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528 )

2025-02-15 09:01:40 +01:00

gelu_quick.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

gelu.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

generic_binary_head.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

generic_head.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

generic_unary_head.comp

vulkan: support copy from f32 to q4_0/q4_1/q5_0/q5_1/q8_0/iq4_nl (#11166 )

2025-01-16 22:47:10 +01:00

get_rows_quant.comp

vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations (#11595 )

2025-02-28 09:42:52 +01:00

get_rows.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

group_norm.comp

vulkan: fix group_norm (#10496 )

2024-11-26 16:45:05 +01:00

im2col.comp

vulkan: improve im2col (#11826 )

2025-02-28 07:52:51 +01:00

leaky_relu.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

mul_mat_split_k_reduce.comp

vulkan: optimize and reenable split_k (#10637 )

2024-12-03 20:29:54 +01:00

mul_mat_vec_base.comp

vulkan: optimize mul_mat for small values of N (#10991 )

2024-12-30 18:27:11 +01:00

mul_mat_vec_iq1_m.comp

vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528 )

2025-02-15 09:01:40 +01:00

mul_mat_vec_iq1_s.comp

vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528 )

2025-02-15 09:01:40 +01:00

mul_mat_vec_iq2_s.comp

vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations (#11595 )

2025-02-28 09:42:52 +01:00

mul_mat_vec_iq2_xs.comp

vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations (#11595 )

2025-02-28 09:42:52 +01:00

mul_mat_vec_iq2_xxs.comp

vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations (#11595 )

2025-02-28 09:42:52 +01:00

mul_mat_vec_iq3_s.comp

vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations (#11595 )

2025-02-28 09:42:52 +01:00

mul_mat_vec_iq3_xxs.comp

vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations (#11595 )

2025-02-28 09:42:52 +01:00

mul_mat_vec_nc.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

mul_mat_vec_p021.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

mul_mat_vec_q2_k.comp

vulkan: scale caching for k quants + misc fixes (#11081 )

2025-01-15 19:50:13 +00:00

mul_mat_vec_q3_k.comp

vulkan: scale caching for k quants + misc fixes (#11081 )

2025-01-15 19:50:13 +00:00

mul_mat_vec_q4_k.comp

vulkan: scale caching for k quants + misc fixes (#11081 )

2025-01-15 19:50:13 +00:00

mul_mat_vec_q5_k.comp

vulkan: scale caching for k quants + misc fixes (#11081 )

2025-01-15 19:50:13 +00:00

mul_mat_vec_q6_k.comp

vulkan: scale caching for k quants + misc fixes (#11081 )

2025-01-15 19:50:13 +00:00

mul_mat_vec.comp

vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528 )

2025-02-15 09:01:40 +01:00

mul_mm_cm2.comp

vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528 )

2025-02-15 09:01:40 +01:00

mul_mm.comp

vulkan: matmul dequantization improvements (#12015 )

2025-02-28 08:20:08 +01:00

mul.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

norm.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

opt_step_adamw.comp

vulkan: implement several ops relevant for ggml_opt (#11769 )

2025-02-17 07:55:57 +01:00

pad.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

pool2d.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

relu.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

repeat_back.comp

vulkan: implement several ops relevant for ggml_opt (#11769 )

2025-02-17 07:55:57 +01:00

repeat.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

rms_norm_back.comp

vulkan: implement more backpropagation operators (#11914 )

2025-02-25 12:04:45 +01:00

rms_norm.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

rope_head.comp

vulkan: implement more backpropagation operators (#11914 )

2025-02-25 12:04:45 +01:00

rope_multi.comp

vulkan: support multi/vision rope, and noncontiguous rope (#11902 )

2025-02-16 08:52:23 +01:00

rope_neox.comp

vulkan: support multi/vision rope, and noncontiguous rope (#11902 )

2025-02-16 08:52:23 +01:00

rope_norm.comp

vulkan: support multi/vision rope, and noncontiguous rope (#11902 )

2025-02-16 08:52:23 +01:00

rope_vision.comp

vulkan: support multi/vision rope, and noncontiguous rope (#11902 )

2025-02-16 08:52:23 +01:00

scale.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

sigmoid.comp

add OP sigmoid (#12056 )

2025-02-25 12:32:20 +01:00

silu_back.comp

vulkan: implement more backpropagation operators (#11914 )

2025-02-25 12:04:45 +01:00

silu.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

sin.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

soft_max_back.comp

vulkan: implement more backpropagation operators (#11914 )

2025-02-25 12:04:45 +01:00

soft_max.comp

Vulkan: Fix float16 use on devices without float16 support + fix subgroup_size_control validation error (#11161 )

2025-01-10 06:39:33 +01:00

square.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

sub.comp

vulkan: implement several ops relevant for ggml_opt (#11769 )

2025-02-17 07:55:57 +01:00

sum_rows.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

tanh.comp

Vulkan: fix NaN in tanh.comp with AMD proprietary driver on Windows (#10723 )

2024-12-08 19:19:19 +01:00

test_coopmat2_support.comp

vulkan: compile a test shader in cmake to check for coopmat2 support (#10713 )

2024-12-08 09:05:55 +01:00

test_coopmat_support.comp

Disable GL_KHR_cooperative_matrix Vulkan extension if not available. (#11117 )

2025-01-08 09:18:13 +01:00

timestep_embedding.comp

ggml : build backends as libraries (#10256 )

2024-11-14 18:04:35 +01:00

types.comp

vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations (#11595 )

2025-02-28 09:42:52 +01:00

upscale.comp

vulkan: Use push constant offset to handle misaligned descriptors (#10987 )

2024-12-29 09:35:11 +01:00

vulkan-shaders-gen.cpp

vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations (#11595 )

2025-02-28 09:42:52 +01:00

wkv6.comp

rwkv6: add wkv6 support for Vulkan backend (#10829 )

2024-12-16 22:00:46 +01:00