prompt='best quality, high quality, wearing sunglasses',
prompt='best quality, high quality, wearing sunglasses',
ip_adapter_image=image,
ip_adapter_image=image,
negative_prompt="monochrome, lowres, bad anatomy, worst quality, low quality",
negative_prompt="monochrome, lowres, bad anatomy, worst quality, low quality",
num_inference_steps=50,
num_inference_steps=50,
generator=generator,
generator=generator,
).images[0]
).images[0]
...
@@ -355,11 +355,13 @@ images
...
@@ -355,11 +355,13 @@ images
### IP-Adapter Plus
### IP-Adapter Plus
IP-Adapter relies on an image encoder to generate image features. If the IP-Adapter repository contains a `image_encoder` subfolder, the image encoder is automatically loaded and registed to the pipeline. Otherwise, you'll need to explicitly load the image encoder with a [`~transformers.CLIPVisionModelWithProjection`] model and pass it to the pipeline.
IP-Adapter relies on an image encoder to generate image features. If the IP-Adapter repository contains an`image_encoder` subfolder, the image encoder is automatically loaded and registered to the pipeline. Otherwise, you'll need to explicitly load the image encoder with a [`~transformers.CLIPVisionModelWithProjection`] model and pass it to the pipeline.
This is the case for *IP-Adapter Plus* checkpoints which use the ViT-H image encoder.
This is the case for *IP-Adapter Plus* checkpoints which use the ViT-H image encoder.