"sgl-kernel/csrc/gemm/per_tensor_quant_fp8.cu" did not exist on "90bb2be27e498be472af40f5ace8b2d9cd817d1d"
-
toni057 authored
* Adding FLOPs and size to model metadata * Adding weight size to quantization models * Small refactor of rich metadata * Removing unused code * Fixing wrong entries * Adding .DS_Store to gitignore * Renaming _flops to _ops * Adding number of operations to quantization models * Reflecting _flops change to _ops * Renamed ops and weight size in individual model doc pages * Linter fixes * Rounding ops to first decimal * Rounding num ops and sizes to 3 decimals * Change naming of columns. * Update tables Co-authored-by:
Toni Blaslov <tblaslov@fb.com> Co-authored-by:
Vasilis Vryniotis <datumbox@users.noreply.github.com>
deba0562