- 03 Dec, 2023 1 commit
-
-
Jeffrey Morgan authored
-
- 02 Dec, 2023 2 commits
-
-
Michael Yang authored
handle ctrl+z
-
Michael Yang authored
-
- 01 Dec, 2023 4 commits
-
-
Michael Yang authored
Fix adapter loading from SHA hash
-
Joshua Pham authored
-
Patrick Devine authored
-
Michael Yang authored
* docker: set PATH, LD_LIBRARY_PATH, and capabilities * example: update k8s gpu manifest
-
- 30 Nov, 2023 6 commits
-
-
Michael Yang authored
revert checksum calculation to calculate-as-you-go
-
Jeffrey Morgan authored
-
James Radtke authored
-
Bruce MacDonald authored
-
Michael Yang authored
-
Michael Yang authored
-
- 29 Nov, 2023 9 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Alec Hammond authored
* Add OllamaEmbeddings to python LangChain example * typo --------- Co-authored-by:Alec Hammond <alechammond@fb.com>
-
Bruce MacDonald authored
-
jeremiahbuckley authored
Co-authored-by:Cloud User <azureuser@testgpu2.hqzwom21okjenksna4y3c4ymjd.phxx.internal.cloudapp.net>
-
Timothy Jaeryang Baek authored
Co-authored-by: rootedbox
-
Patrick Devine authored
-
ToasterUwU authored
-
Michael authored
add new recent models as examples
-
- 28 Nov, 2023 3 commits
-
-
Michael Yang authored
progress: fix bar rate
-
Michael Yang authored
-
ftorto authored
Fix a typo in the CA update command
-
- 27 Nov, 2023 3 commits
-
-
Jason Jacobs authored
-
Bruce MacDonald authored
* add remote create to python example client
-
Kasumi authored
-
- 26 Nov, 2023 4 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
Co-authored-by:Wen Sun <iwendellsun@gmail.com>
-
- 24 Nov, 2023 2 commits
-
-
Jing Zhang authored
* Support cuda build in Windows * Enable dynamic NumGPU allocation for Windows
-
Jongwook Choi authored
When CUDA peer access is enabled, multi-gpu inference will produce garbage output. This is a known bug of llama.cpp (or nvidia). Until the upstream bug is fixed, we can disable CUDA peer access temporarily to ensure correct output. See #961.
-
- 22 Nov, 2023 5 commits
-
-
Jeffrey Morgan authored
-
Michael Yang authored
fix: gguf int type
-
Michael Yang authored
-
Long Huynh authored
-
Jeffrey Morgan authored
-
- 21 Nov, 2023 1 commit
-
-
Bruce MacDonald authored
-