- 20 Mar, 2026 1 commit
-
-
Martin Hickey authored
Signed-off-by:Martin Hickey <martin.hickey@ie.ibm.com>
-
- 18 Mar, 2026 1 commit
-
-
Thillai Chithambaram authored
Signed-off-by:Thillai Chithambaram <thillaichithambaram.a@gmail.com>
-
- 23 Feb, 2026 1 commit
-
-
Mark McLoughlin authored
Export the existing Model FLOPs Utilization (MFU) metrics via Prometheus. `--enable-mfu-metrics` is required for these to be exposed. Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
Mark McLoughlin <markmc@redhat.com>
-
- 27 Jan, 2026 1 commit
-
-
omkhalil authored
Fix UnembedMetrics to correctly count FLOPs for the unembedding (LM head) layer. The bug: UnembedMetrics used total_num_tokens() which counts all tokens in the batch for projection flops, vocab projections are run on just the last token for the autoregressive use case. Co-authored-by:Omar Mohamed Khalil <omarkhalil@meta.com>
-
- 18 Dec, 2025 1 commit
-
-
SungMinCho authored
Signed-off-by:
SungMinCho <tjdals4565@gmail.com> Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Mark McLoughlin <markmc@redhat.com>
-