To see all possible deploy flags and options, you can use the `--help` flag. It's possible to configure the number of shards, quantization, generation parameters, and more.
```bash
docker run ghcr.io/huggingface/text-generation-inference:2.0.3--help
docker run ghcr.io/huggingface/text-generation-inference:2.0.4--help