update git action env, add BALANCE_SERVE=1

aeabd783 · Azure-Tang · 31677181 · aeabd783 · aeabd783
Commit aeabd783 authored Apr 01, 2025 by Azure-Tang
Hide whitespace changes
Inline Side-by-side

Showing with 15 additions and 10 deletions

.github/workflows/package_wheel_release.yml .github/workflows/package_wheel_release.yml +2 -0

doc/en/install.md doc/en/install.md +13 -10

No files found.
--- a/.github/workflows/package_wheel_release.yml
+++ b/.github/workflows/package_wheel_release.yml
@@ -163,6 +163,8 @@ jobs:
      - name: build for cuda
        if: matrix.cuda != ''
+        env:
+          BALANCE_SERVE: "1"
        run: |
          git submodule init
          git submodule update

--- a/doc/en/install.md
+++ b/doc/en/install.md
@@ -2,11 +2,12 @@
 # How to Run DeepSeek-R1
- [Preparation](#preparation)
+- [How to Run DeepSeek-R1](#how-to-run-deepseek-r1)
- [Installation](#installation)
+  - [Preparation](#preparation)
-  - [Attention](#attention)
+  - [Installation](#installation)
-  - [Supported models include:](#supported-models-include)
+    - [Attention](#attention)
-  - [Support quantize format:](#support-quantize-format)
+    - [Supported models include](#supported-models-include)
+    - [Support quantize format](#support-quantize-format)
 In this document, we will show you how to install and run KTransformers on your local machine. There are two versions:
@@ -87,7 +88,7 @@ sudo apt install libtbb-dev libssl-dev libcurl4-openssl-dev libaio1 libaio-dev l
   for windows we prepare a pre compiled whl package on [ktransformers-0.2.0+cu125torch24avx2-cp312-cp312-win_amd64.whl](https://github.com/kvcache-ai/ktransformers/releases/download/v0.2.0/ktransformers-0.2.0+cu125torch24avx2-cp312-cp312-win_amd64.whl), which require cuda-12.5, torch-2.4, python-3.11, more pre compiled package are being produced.  -->
-* Download source code and compile:
+Download source code and compile:
  - init source code
@@ -122,7 +123,7 @@ sudo apt install libtbb-dev libssl-dev libcurl4-openssl-dev libaio1 libaio-dev l
      ```shell
      sudo env USE_BALANCE_SERVE=1 USE_NUMA=1 PYTHONPATH="\$(which python)" PATH="\$(dirname \$(which python)):\$PATH" bash ./install.sh
      ```
-  - For Windows
+  - For Windows (Windows native temprarily deprecated, please try WSL)
    ```shell
    install.bat
@@ -183,7 +184,7 @@ It features the following arguments:
 <details>
 <summary>Supported Models/quantization</summary>
-### Supported models include:
+### Supported models include
 | ✅**Supported Models** | ❌**Deprecated Models**    |
@@ -197,12 +198,14 @@ It features the following arguments:
 | Mixtral-8x7B           |                            |
 | Mixtral-8x22B          |                            |
-### Support quantize format:
+### Support quantize format
 | ✅**Supported Formats** | ❌**Deprecated Formats** |
 | ----------------------- | ------------------------ |
-| Q2_K_L                  | ~~IQ2_XXS~~              |
+| IQ1_S                   | ~~IQ2_XXS~~              |
+| IQ2_XXS                 |                          |
+| Q2_K_L                  |                          |
 | Q2_K_XS                 |                          |
 | Q3_K_M                  |                          |
 | Q4_K_M                  |                          |