Files
sglang/sgl-kernel/python/sgl_kernel/attention.py

8 lines
195 B
Python
Raw Normal View History

import torch
def lightning_attention_decode(q, k, v, past_kv, slope, output, new_kv):
torch.ops.sgl_kernel.lightning_attention_decode(
q, k, v, past_kv, slope, output, new_kv
)