release initial code

Co-authored-by: Ying Sheng <sqy1415@gmail.com> Co-authored-by: Liangsheng Yin <hnyls2002@gmail.com> Co-authored-by: Zhiqiang Xie <xiezhq@stanford.edu> Co-authored-by: parasol-aser <3848358+parasol-aser@users.noreply.github.com> Co-authored-by: LiviaSun <33578456+ChuyueSun@users.noreply.github.com> Co-authored-by: Cody Yu <hao.yu.cody@gmail.com>

release initial code
Co-authored-by: Ying Sheng <sqy1415@gmail.com> Co-authored-by: Liangsheng Yin <hnyls2002@gmail.com> Co-authored-by: Zhiqiang Xie <xiezhq@stanford.edu> Co-authored-by: parasol-aser <3848358+parasol-aser@users.noreply.github.com> Co-authored-by: LiviaSun <33578456+ChuyueSun@users.noreply.github.com> Co-authored-by: Cody Yu <hao.yu.cody@gmail.com>
22085081 · Lianmin Zheng · f6d40df0 · 22085081 · 22085081 · 22085081
Commit 22085081 authored Jan 08, 2024 by Lianmin Zheng
20 changed files
--- a/docs/test_process.md
+++ b/docs/test_process.md
--- a/examples/quick_start/anthropic_example_chat.py
+++ b/examples/quick_start/anthropic_example_chat.py
--- a/examples/quick_start/anthropic_example_complete.py
+++ b/examples/quick_start/anthropic_example_complete.py
--- a/examples/quick_start/anthropic_example_stream.py
+++ b/examples/quick_start/anthropic_example_stream.py
--- a/examples/quick_start/more_stream_methods.py
+++ b/examples/quick_start/more_stream_methods.py
--- a/examples/quick_start/openai_example_chat.py
+++ b/examples/quick_start/openai_example_chat.py
--- a/examples/quick_start/openai_example_complete.py
+++ b/examples/quick_start/openai_example_complete.py
--- a/examples/quick_start/openai_example_stream.py
+++ b/examples/quick_start/openai_example_stream.py
--- a/examples/quick_start/srt_example_chat.py
+++ b/examples/quick_start/srt_example_chat.py
--- a/examples/quick_start/srt_example_complete.py
+++ b/examples/quick_start/srt_example_complete.py
--- a/examples/quick_start/srt_example_regex.py
+++ b/examples/quick_start/srt_example_regex.py
--- a/examples/quick_start/srt_example_stream.py
+++ b/examples/quick_start/srt_example_stream.py
--- a/format.sh
+++ b/format.sh
+isort python
+black python
+isort test
+black test
--- a/playground/launch_tgi.sh
+++ b/playground/launch_tgi.sh
+# Assuming the model is downdloaded at /home/ubuntu/model_weights/Llama-2-7b-chat-hf
+docker run --name tgi --rm -ti --gpus all --network host \
+  -v /home/ubuntu/model_weights/Llama-2-7b-chat-hf:/Llama-2-7b-chat-hf \
+  ghcr.io/huggingface/text-generation-inference:1.1.0 \
+  --model-id /Llama-2-7b-chat-hf --num-shard 1  --trust-remote-code \
+  --max-input-length 2048 --max-total-tokens 4096 \
+  --port 24000
--- a/playground/load_tokenizer.py
+++ b/playground/load_tokenizer.py
--- a/python/pyproject.toml
+++ b/python/pyproject.toml
--- a/python/sglang/__init__.py
+++ b/python/sglang/__init__.py
+from sglang.api import *
+from sglang.global_config import global_config
--- a/python/sglang/api.py
+++ b/python/sglang/api.py
--- a/python/sglang/backend/__init__.py
+++ b/python/sglang/backend/__init__.py
--- a/python/sglang/backend/anthropic.py
+++ b/python/sglang/backend/anthropic.py