open-webui.md 1.35 KB
Newer Older
1
# Open WebUI
Reid's avatar
Reid committed
2

3
4
5
6
[Open WebUI](https://github.com/open-webui/open-webui) is an extensible, feature-rich,
and user-friendly self-hosted AI platform designed to operate entirely offline.
It supports various LLM runners like Ollama and OpenAI-compatible APIs,
with built-in RAG capabilities, making it a powerful AI deployment solution.
Reid's avatar
Reid committed
7

8
To get started with Open WebUI using vLLM, follow these steps:
Reid's avatar
Reid committed
9

10
1. Install the [Docker](https://docs.docker.com/engine/install/).
Reid's avatar
Reid committed
11

12
2. Start the vLLM server with a supported chat completion model:
Reid's avatar
Reid committed
13

14
15
16
    ```console
    vllm serve Qwen/Qwen3-0.6B-Chat
    ```
Reid's avatar
Reid committed
17

18
19
20
    !!! note
        When starting the vLLM server, be sure to specify the host and port using the `--host` and `--port` flags.
        For example:
Reid's avatar
Reid committed
21

22
23
24
        ```console
        python -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --port 8000
        ```
Reid's avatar
Reid committed
25

26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
3. Start the Open WebUI Docker container:

    ```console
    docker run -d \
        --name open-webui \
        -p 3000:8080 \
        -v open-webui:/app/backend/data \
        -e OPENAI_API_BASE_URL=http://0.0.0.0:8000/v1 \
        --restart always \
        ghcr.io/open-webui/open-webui:main
    ```

4. Open it in the browser: <http://open-webui-host:3000/>

    At the top of the page, you should see the model `Qwen/Qwen3-0.6B-Chat`.

    ![Web portal of model Qwen/Qwen3-0.6B-Chat](../../assets/deployment/open_webui.png)