Support 10-bit LogFMT (simulated version) (#284)
* Add LogFMT interface
* Update comments
* Add simulated code
* Fix comments
* Change to 128 channels
* Add notes
* Optimize performance
* optimize simulate logfmt 10bit
* Minor fix
* Stronger low latency tests
* Minor fix
* Stronger low latency tests for logfmt
* Optimize logfmt simulate: lg2/ex2 ptx, step_inv
* Minor fix
* Minor fix
* Add non-logfmt bench
* Fix value=0 for logfmt
* Optimize performance
* Refactor tests
---------
Co-authored-by:
Zhean Xu <xza@deepseek.com>
Showing
Please register or sign in to comment