<small> Chameleon incorporates a vector quantizer module to transform images into discrete tokens. That also enables image geenration using an auto-regressive transformer. Taken from the <ahref="https://arxiv.org/abs/2405.09818v1">original paper.</a></small>
<small> Chameleon incorporates a vector quantizer module to transform images into discrete tokens. That also enables image generation using an auto-regressive transformer. Taken from the <ahref="https://arxiv.org/abs/2405.09818v1">original paper.</a></small>
This model was contributed by [joaogante](https://huggingface.co/joaogante) and [RaushanTurganbay](https://huggingface.co/RaushanTurganbay).
This model was contributed by [joaogante](https://huggingface.co/joaogante) and [RaushanTurganbay](https://huggingface.co/RaushanTurganbay).
The original code can be found [here](https://github.com/facebookresearch/chameleon).
The original code can be found [here](https://github.com/facebookresearch/chameleon).
...
@@ -61,6 +61,7 @@ The original code can be found [here](https://github.com/facebookresearch/chamel
...
@@ -61,6 +61,7 @@ The original code can be found [here](https://github.com/facebookresearch/chamel
### Single image inference
### Single image inference
Chameleon is a gated model so make sure to have access and login to Hugging Face Hub using a token.
Here's how to load the model and perform inference in half-precision (`torch.float16`):
Here's how to load the model and perform inference in half-precision (`torch.float16`):