Commit Graph

5 Commits

Author SHA1 Message Date
Yineng Zhang
3dbd73d319 minor: rm unused _grouped_size_compiled_for_decode_kernels (#2299) 2024-12-01 19:24:12 +08:00
Yineng Zhang
118b6af35e feat: add should_use_tensor_core (#2179) 2024-12-01 18:01:16 +08:00
Liangsheng Yin
99ec439da4 Organize Attention Backends (#1547) 2024-09-30 15:54:18 -07:00
Lianmin Zheng
3a6e8b6d78 [Minor] move triton attention kernels into a separate folder (#1379) 2024-09-10 15:15:08 -07:00
Ying Sheng
995af5a54b Improve the structure of CI (#911) 2024-08-03 23:09:21 -07:00