"megatron/legacy/mpu/tests/test_random.py" did not exist on "7abd3e90d09cf02a3a019c9a2af67ee31d5b1bd4"
- 02 Dec, 2025 1 commit
-
-
Patrick Devine authored
This change: * fixes rope scaling in the mistral converter * updates ministral to include llama4 scaling * includes a new ministral parser for parsing reasoning and tool calling --------- Co-authored-by:jmorganca <jmorganca@gmail.com>
-
- 20 Nov, 2025 1 commit
-
-
Grace authored
-
- 14 Oct, 2025 1 commit
-
-
Devon Rifkin authored
-