- 04 Aug, 2025 1 commit
-
-
Michael Yang authored
-
- 26 Jun, 2025 1 commit
-
-
Michael Yang authored
* update patches * cherry pick metal mean kernel * cherry pick cuda mean kernel * gemma3n
-
- 21 May, 2025 2 commits
-
-
Michael Yang authored
-
Michael Yang authored
* feat: qwen3 dense * feat: qwen3moe * fix llama4 moe
-
- 14 May, 2025 1 commit
-
-
Bruce MacDonald authored
-
- 25 Apr, 2025 1 commit
-
-
Michael Yang authored
-
- 03 Apr, 2025 1 commit
-
-
Bruce MacDonald authored
Mistral is a popular research lab making open source models. This updates the forward pass of llama architecture models to support both llama models and mistral models by accounting for additional metadata present in mistral models, and finding the correct dimensions for the output projection.
-
- 11 Mar, 2025 1 commit
-
-
Patrick Devine authored
-
- 14 Feb, 2025 1 commit
-
-
Jesse Gross authored
This allows there to be a file that is a list of models that is not mixed into the runner code.
-