Unverified Commit 5062fda4 authored by Nicolas Patry's avatar Nicolas Patry Committed by GitHub
Browse files

Push users to streaming in the readme. (#1698)

parent c7e570e5
...@@ -82,7 +82,7 @@ docker run --gpus all --shm-size 1g -p 8080:80 -v $volume:/data ghcr.io/huggingf ...@@ -82,7 +82,7 @@ docker run --gpus all --shm-size 1g -p 8080:80 -v $volume:/data ghcr.io/huggingf
And then you can make requests like And then you can make requests like
```bash ```bash
curl 127.0.0.1:8080/generate \ curl 127.0.0.1:8080/generate_stream \
-X POST \ -X POST \
-d '{"inputs":"What is Deep Learning?","parameters":{"max_new_tokens":20}}' \ -d '{"inputs":"What is Deep Learning?","parameters":{"max_new_tokens":20}}' \
-H 'Content-Type: application/json' -H 'Content-Type: application/json'
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment