"tests/vscode:/vscode.git/clone" did not exist on "53ffe40e5d8e7892b2454336746ad7ff0064c762"
  1. 21 May, 2025 2 commits
  2. 20 May, 2025 1 commit
  3. 19 May, 2025 2 commits
  4. 15 May, 2025 2 commits
  5. 14 May, 2025 2 commits
  6. 09 May, 2025 4 commits
  7. 08 May, 2025 1 commit
    • Graham King's avatar
      feat: Qwen3, Gemma3 and Llama4 support (#1002) · ceaeba3e
      Graham King authored
      . New mistralrs and llamacpp version
      . mistralrs: Handle Gemma 3 and Llama 4 as vision models
      . Update the dynamo-run docs to use Qwen 3
      . Our pre-processor now supports Llama 4's newer multi-modal `config.json`
      . Upgrade minijinja to handle Qwen 3's prompt template
      
      For Llama 4 we'll need to limit the max seq len. vllm says:
      > To serve at least one request with the models's max seq len (10485760), (240.00 GiB KV cache is needed,...
      
      I was able to run Llama 4 with llamacpp and a quantized GGUF, with Dynamo doing the pre-processing.
      ceaeba3e
  8. 07 May, 2025 3 commits
  9. 06 May, 2025 3 commits
  10. 05 May, 2025 1 commit
  11. 29 Apr, 2025 2 commits
  12. 28 Apr, 2025 3 commits
  13. 26 Apr, 2025 2 commits
  14. 25 Apr, 2025 2 commits
  15. 24 Apr, 2025 1 commit
  16. 23 Apr, 2025 2 commits
  17. 22 Apr, 2025 1 commit
  18. 21 Apr, 2025 1 commit
  19. 18 Apr, 2025 4 commits
  20. 15 Apr, 2025 1 commit