streamlit.md 1.24 KB
Newer Older
1
# Streamlit
2
3
4
5
6
7
8

[Streamlit](https://github.com/streamlit/streamlit) lets you transform Python scripts into interactive web apps in minutes, instead of weeks. Build dashboards, generate reports, or create chat apps.

It can be quickly integrated with vLLM as a backend API server, enabling powerful LLM inference via API calls.

## Prerequisites

9
Set up the vLLM environment by installing all required packages:
10

11
```bash
12
pip install vllm streamlit openai
13
14
```

15
## Deploy
16

17
1. Start the vLLM server with a supported chat completion model, e.g.
18

19
20
21
    ```bash
    vllm serve Qwen/Qwen1.5-0.5B-Chat
    ```
22

23
1. Use the script: [examples/online_serving/streamlit_openai_chatbot_webserver.py](../../../examples/online_serving/streamlit_openai_chatbot_webserver.py)
24

25
1. Start the streamlit web UI and start to chat:
26

27
    ```bash
Reid's avatar
Reid committed
28
    streamlit run streamlit_openai_chatbot_webserver.py
29

30
31
32
33
34
35
36
    # or specify the VLLM_API_BASE or VLLM_API_KEY
    VLLM_API_BASE="http://vllm-server-host:vllm-server-port/v1" \
        streamlit run streamlit_openai_chatbot_webserver.py

    # start with debug mode to view more details
    streamlit run streamlit_openai_chatbot_webserver.py --logger.level=debug
    ```
37

38
    ![Chat with vLLM assistant in Streamlit](../../assets/deployment/streamlit-chat.png)