Merge pull request #990 from InfiniTensor/demo131
Demo-131 Cuda graph with optimized paged attention
Showing
test/infiniop/w8a8int8.py
0 → 100644
xmake/ali.lua
0 → 100644
Please register or sign in to comment