Commit Graph

8 Commits

Author SHA1 Message Date
Yineng Zhang
ebd9dbe71b fix: revert #8593 (#9581) 2025-08-25 01:29:06 -07:00
Pavani Majety
3cc3d9b950 Add Support for Page Size greater than 1 for Flashinfer MLA Backend (#8593)
Signed-off-by: Pavani Majety <pmajety@nvidia.com>
2025-08-21 18:15:06 -07:00
fzyzcjy
15ddd84322 Add retry for flaky tests in CI (#4755) 2025-03-25 16:53:12 -07:00
lukec
21d485f835 Fix test_create_kvindices unit test (#4452) 2025-03-15 16:01:04 -07:00
Liangsheng Yin
99ec439da4 Organize Attention Backends (#1547) 2024-09-30 15:54:18 -07:00
Lianmin Zheng
fec185ce0c Refactor attention backend (#1381) 2024-09-11 11:44:26 -07:00
Liangsheng Yin
144bc70fcc Organize flashinfer indices update (#1378) 2024-09-10 17:38:59 -07:00
xiaobochen
d134c139a1 Optimize the update flashinfer indices (#1262) 2024-08-31 23:40:28 -07:00