"driver/include/device_tensor.hpp" did not exist on "a34977df3bb0dab6e3c08c5e56ab8b572238f209"
support max3 in smoothquant and add+ rmsnorm + rdquant (#1654)
* Fix cmake example build * Support max3 in smoothquant one pass * support max3 in two pass * support max3 in add_rmsnorm_rdquant
Showing
Please register or sign in to comment