"router/vscode:/vscode.git/clone" did not exist on "142cdabed377772b763fc8d79a131b16ed991718"
  1. 05 May, 2025 1 commit
  2. 21 Mar, 2025 1 commit
  3. 20 Mar, 2025 1 commit
  4. 14 Feb, 2025 1 commit
    • Michael Yang's avatar
      next ollama runner (#7913) · 58245413
      Michael Yang authored
      
      
      feat: add new Ollama engine using ggml through cgo
      
      This change introduces a new way to run pretrained models. It introduces 3 high level interfaces and a bunch of smaller helper interfaces to facilitate this.
      
      - `model.Model` defines the interface for a model architecture. Models such as `llama` and `mllama`, which are provided as examples, can implement the model's forward propagation in the `Forward` method. This method will be called to generate completions. This interface can be found in `model/model.go`
      - `ml.Backend` defines the interface for a backend tensor library, in this case `ggml`. Among other things, a Backend is responsible for loading a pretrained model into hardware (GPU, CPU, etc) and providing an interface for Models to access loaded tensors. This interface can be found in `ml/backend.go`
      - `ml.Tensor` defines the interface for a tensor and tensor operations
      
      This is the first implementation of the new engine. Follow up PRs will implement more features:
      
      - non-greedy sampling (#8410)
      - integration with Ollama and KV caching (#8301)
      - more model support (#9080) with more coming soon
      Co-authored-by: default avatarBruce MacDonald <brucewmacdonald@gmail.com>
      58245413
  5. 21 Jan, 2025 1 commit
  6. 16 Jan, 2025 1 commit
  7. 11 Jan, 2025 1 commit
  8. 01 Jan, 2025 1 commit
  9. 10 Dec, 2024 1 commit
  10. 14 Nov, 2024 1 commit
  11. 06 Nov, 2024 1 commit
  12. 02 Aug, 2024 1 commit
  13. 27 Jul, 2024 1 commit
  14. 01 Jul, 2024 2 commits
  15. 27 Jun, 2024 2 commits
  16. 13 Jun, 2024 1 commit
  17. 04 Jun, 2024 1 commit
  18. 20 May, 2024 1 commit
  19. 07 May, 2024 1 commit
  20. 01 May, 2024 6 commits
  21. 25 Jan, 2024 1 commit
  22. 05 Jan, 2024 1 commit