Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
renzhc
diffusers_dcu
Commits
a74f02fb
Unverified
Commit
a74f02fb
authored
Mar 05, 2025
by
Yuxuan Zhang
Committed by
GitHub
Mar 04, 2025
Browse files
[Docs] CogView4 comment fix (#10957)
* Update pipeline_cogview4.py * Use GLM instead of T5 in doc
parent
66bf7ea5
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
6 deletions
+4
-6
src/diffusers/pipelines/cogview4/pipeline_cogview4.py
src/diffusers/pipelines/cogview4/pipeline_cogview4.py
+4
-6
No files found.
src/diffusers/pipelines/cogview4/pipeline_cogview4.py
View file @
a74f02fb
...
@@ -143,13 +143,11 @@ class CogView4Pipeline(DiffusionPipeline):
...
@@ -143,13 +143,11 @@ class CogView4Pipeline(DiffusionPipeline):
Args:
Args:
vae ([`AutoencoderKL`]):
vae ([`AutoencoderKL`]):
Variational Auto-Encoder (VAE) Model to encode and decode images to and from latent representations.
Variational Auto-Encoder (VAE) Model to encode and decode images to and from latent representations.
text_encoder ([`T5EncoderModel`]):
text_encoder ([`GLMModel`]):
Frozen text-encoder. CogView4 uses
Frozen text-encoder. CogView4 uses [glm-4-9b-hf](https://huggingface.co/THUDM/glm-4-9b-hf).
[T5](https://huggingface.co/docs/transformers/model_doc/t5#transformers.T5EncoderModel); specifically the
tokenizer (`PreTrainedTokenizer`):
[t5-v1_1-xxl](https://huggingface.co/PixArt-alpha/PixArt-alpha/tree/main/t5-v1_1-xxl) variant.
tokenizer (`T5Tokenizer`):
Tokenizer of class
Tokenizer of class
[
T5
Tokenizer](https://huggingface.co/docs/transformers/m
odel_doc/t5
#transformers.
T5
Tokenizer).
[
PreTrained
Tokenizer](https://huggingface.co/docs/transformers/m
ain/en/main_classes/tokenizer
#transformers.
PreTrained
Tokenizer).
transformer ([`CogView4Transformer2DModel`]):
transformer ([`CogView4Transformer2DModel`]):
A text conditioned `CogView4Transformer2DModel` to denoise the encoded image latents.
A text conditioned `CogView4Transformer2DModel` to denoise the encoded image latents.
scheduler ([`SchedulerMixin`]):
scheduler ([`SchedulerMixin`]):
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment