"vscode:/vscode.git/clone" did not exist on "70eecbd6b0cf2808c1e5c5f74eee3a78456d6a2e"
Unverified Commit 4caca1ba authored by Lianmin Zheng's avatar Lianmin Zheng Committed by GitHub
Browse files

Clean up server args & Add CI scripts (#12124)

parent ceb105a7
...@@ -3,7 +3,7 @@ ...@@ -3,7 +3,7 @@
You can install SGLang using one of the methods below. You can install SGLang using one of the methods below.
This page primarily applies to common NVIDIA GPU platforms. This page primarily applies to common NVIDIA GPU platforms.
For other or newer platforms, please refer to the dedicated pages for [AMD GPUs](../platforms/amd_gpu.md), [Intel Xeon CPUs](../platforms/cpu_server.md), [NVIDIA Jetson](../platforms/nvidia_jetson.md), [Ascend NPUs](../platforms/ascend_npu.md). For other or newer platforms, please refer to the dedicated pages for [AMD GPUs](../platforms/amd_gpu.md), [Intel Xeon CPUs](../platforms/cpu_server.md), [TPU](../platforms/tpu.md), [NVIDIA DGX Spark](https://lmsys.org/blog/2025-10-13-nvidia-dgx-spark/), [NVIDIA Jetson](../platforms/nvidia_jetson.md), [Ascend NPUs](../platforms/ascend_npu.md).
## Method 1: With pip or uv ## Method 1: With pip or uv
......
...@@ -12,10 +12,12 @@ if __name__ == "__main__": ...@@ -12,10 +12,12 @@ if __name__ == "__main__":
try: try:
if server_args.grpc_mode: if server_args.grpc_mode:
# Handle gRPC server
from sglang.srt.entrypoints.grpc_server import serve_grpc from sglang.srt.entrypoints.grpc_server import serve_grpc
asyncio.run(serve_grpc(server_args)) asyncio.run(serve_grpc(server_args))
else: else:
# Handle HTTP server
from sglang.srt.entrypoints.http_server import launch_server from sglang.srt.entrypoints.http_server import launch_server
launch_server(server_args) launch_server(server_args)
......
This diff is collapsed.
...@@ -138,6 +138,7 @@ def is_xpu() -> bool: ...@@ -138,6 +138,7 @@ def is_xpu() -> bool:
return hasattr(torch, "xpu") and torch.xpu.is_available() return hasattr(torch, "xpu") and torch.xpu.is_available()
@lru_cache(maxsize=1)
def is_npu() -> bool: def is_npu() -> bool:
return hasattr(torch, "npu") and torch.npu.is_available() return hasattr(torch, "npu") and torch.npu.is_available()
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment