Commit 592296cd authored by Chenggang Zhao's avatar Chenggang Zhao
Browse files

Add some plans

parent 1553fc42
......@@ -280,6 +280,13 @@ For two micro-batch overlapping, you can refer to the following figure. With our
![low-latency](figures/low-latency.png)
## Roadmap
- [ ] A100 support (intranode only)
- [ ] Support BF16 for the low-latency dispatch kernel
- [ ] Support NVLink protocol for intranode low-latency kernels
- [ ] SM-free normal kernels
## Notices
#### Easier potential overall design
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment