Add fp8 inference
Showing
inference-fp8/config.json
0 → 100644
inference-fp8/convert.py
0 → 100644
inference-fp8/generate.py
0 → 100644
inference-fp8/kernel.py
0 → 100644
inference-fp8/model.py
0 → 100644
This diff is collapsed.
Please register or sign in to comment