Commits · e6f39bce8f3f122c9b7f9d444c8ea49f274accfe · OpenDAS / ollama

04 Aug, 2025 2 commits

cuda graph · e6f39bce
Michael Yang authored Jul 31, 2025

e6f39bce

Daniel Hiltgen authored Jul 16, 2025

This implements the Open Compute Microscaling (MX) FP4 format
as a tensor type with backend implementations focusing
on mulmat and mulmatid on CPU, CUDA, and Metal.

4fb47ed3