Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
41f17bf2
Unverified
Commit
41f17bf2
authored
Sep 12, 2025
by
Hyogeun Oh (오효근)
Committed by
GitHub
Sep 12, 2025
Browse files
[Docs] Fix warnings in mkdocs build (continued) (#24740)
Signed-off-by:
Zerohertz
<
ohg3417@gmail.com
>
parent
bcb06d7b
Changes
10
Show whitespace changes
Inline
Side-by-side
Showing
10 changed files
with
121 additions
and
176 deletions
+121
-176
vllm/model_executor/layers/quantization/torchao.py
vllm/model_executor/layers/quantization/torchao.py
+5
-5
vllm/model_executor/layers/quantization/utils/int8_utils.py
vllm/model_executor/layers/quantization/utils/int8_utils.py
+1
-1
vllm/model_executor/layers/rotary_embedding/mrope.py
vllm/model_executor/layers/rotary_embedding/mrope.py
+2
-2
vllm/model_executor/model_loader/tensorizer.py
vllm/model_executor/model_loader/tensorizer.py
+45
-44
vllm/model_executor/models/aria.py
vllm/model_executor/models/aria.py
+4
-12
vllm/model_executor/models/bart.py
vllm/model_executor/models/bart.py
+40
-64
vllm/model_executor/models/blip2.py
vllm/model_executor/models/blip2.py
+0
-1
vllm/model_executor/models/donut.py
vllm/model_executor/models/donut.py
+6
-12
vllm/model_executor/models/florence2.py
vllm/model_executor/models/florence2.py
+14
-24
vllm/model_executor/models/glm4_1v.py
vllm/model_executor/models/glm4_1v.py
+4
-11
No files found.
vllm/model_executor/layers/quantization/torchao.py
View file @
41f17bf2
...
@@ -144,8 +144,8 @@ def torchao_quantize_param_data(param: torch.Tensor,
...
@@ -144,8 +144,8 @@ def torchao_quantize_param_data(param: torch.Tensor,
"""Quantize a Tensor with torchao quantization specified by torchao_config
"""Quantize a Tensor with torchao quantization specified by torchao_config
Args:
Args:
`
param
`
: weight parameter of the linear module
param: weight parameter of the linear module
`
torchao_config
`
: type of quantization and their arguments we want to
torchao_config: type of quantization and their arguments we want to
use to quantize the Tensor
use to quantize the Tensor
"""
"""
from
torchao.core.config
import
AOBaseConfig
from
torchao.core.config
import
AOBaseConfig
...
@@ -172,8 +172,8 @@ class TorchAOLinearMethod(LinearMethodBase):
...
@@ -172,8 +172,8 @@ class TorchAOLinearMethod(LinearMethodBase):
"""Linear method for torchao.
"""Linear method for torchao.
Args:
Args:
torchao
_config: The torchao quantization config, a string
quant
_config: The torchao quantization config, a string
that encodes
that encodes
the type of quantization and all relevant arguments.
the type of quantization and all relevant arguments.
"""
"""
def
__init__
(
self
,
quant_config
:
TorchAOConfig
):
def
__init__
(
self
,
quant_config
:
TorchAOConfig
):
...
...
vllm/model_executor/layers/quantization/utils/int8_utils.py
View file @
41f17bf2
...
@@ -423,7 +423,7 @@ def w8a8_block_int8_matmul(
...
@@ -423,7 +423,7 @@ def w8a8_block_int8_matmul(
Bs: The per-block quantization scale for `B`.
Bs: The per-block quantization scale for `B`.
block_size: The block size for per-block quantization. It should be
block_size: The block size for per-block quantization. It should be
2-dim, e.g., [128, 128].
2-dim, e.g., [128, 128].
output_d
y
tpe: The dtype of the returned tensor.
output_dt
y
pe: The dtype of the returned tensor.
Returns:
Returns:
torch.Tensor: The result of matmul.
torch.Tensor: The result of matmul.
...
...
vllm/model_executor/layers/rotary_embedding/mrope.py
View file @
41f17bf2
...
@@ -135,8 +135,8 @@ def triton_mrope(
...
@@ -135,8 +135,8 @@ def triton_mrope(
"""Qwen2VL mrope kernel.
"""Qwen2VL mrope kernel.
Args:
Args:
q
uery
: [num_tokens, num_heads * head_size]
q: [num_tokens, num_heads * head_size]
k
ey
: [num_tokens, num_kv_heads * head_size]
k: [num_tokens, num_kv_heads * head_size]
cos: [3, num_tokens, head_size //2 ]
cos: [3, num_tokens, head_size //2 ]
(T/H/W positions with multimodal inputs)
(T/H/W positions with multimodal inputs)
sin: [3, num_tokens, head_size //2 ]
sin: [3, num_tokens, head_size //2 ]
...
...
vllm/model_executor/model_loader/tensorizer.py
View file @
41f17bf2
...
@@ -171,22 +171,23 @@ class TensorizerConfig(MutableMapping):
...
@@ -171,22 +171,23 @@ class TensorizerConfig(MutableMapping):
_is_sharded
:
bool
=
field
(
init
=
False
,
default
=
False
)
_is_sharded
:
bool
=
field
(
init
=
False
,
default
=
False
)
_fields
:
ClassVar
[
tuple
[
str
,
...]]
_fields
:
ClassVar
[
tuple
[
str
,
...]]
_keys
:
ClassVar
[
frozenset
[
str
]]
_keys
:
ClassVar
[
frozenset
[
str
]]
"""
"""Configuration class for Tensorizer settings.
Args for the TensorizerConfig class. These are used to configure the
behavior of model serialization and deserialization using Tensorizer.
Args:
These settings configure the behavior of model serialization and
deserialization using Tensorizer.
Attributes:
tensorizer_uri: Path to serialized model tensors. Can be a local file
tensorizer_uri: Path to serialized model tensors. Can be a local file
path or a S3 URI. This is a required field unless lora_dir is
path or a S3 URI. This is a required field unless lora_dir is
provided and the config is meant to be used for the
provided and the config is meant to be used for the
`tensorize_lora_adapter` function. Unless a `tensorizer_dir` or
`tensorize_lora_adapter` function. Unless a `tensorizer_dir` or
`lora_dir` is passed to this object's initializer, this is
a required
`lora_dir` is passed to this object's initializer, this is
argument.
a required
argument.
tensorizer_dir: Path to a directory containing serialized model tensors,
tensorizer_dir: Path to a directory containing serialized model tensors,
and all other potential model artifacts to load the model, such as
and all other potential model artifacts to load the model, such as
configs and tokenizer files. Can be passed instead of
`tensorizer_uri`
configs and tokenizer files. Can be passed instead of
where the `model.tensors` file will be assumed
to be in this
`tensorizer_uri`
where the `model.tensors` file will be assumed
directory.
to be in this
directory.
vllm_tensorized: If True, indicates that the serialized model is a
vllm_tensorized: If True, indicates that the serialized model is a
vLLM model. This is used to determine the behavior of the
vLLM model. This is used to determine the behavior of the
TensorDeserializer when loading tensors from a serialized model.
TensorDeserializer when loading tensors from a serialized model.
...
@@ -194,9 +195,9 @@ class TensorizerConfig(MutableMapping):
...
@@ -194,9 +195,9 @@ class TensorizerConfig(MutableMapping):
tensorizer's optimized GPU loading. Note that this is now
tensorizer's optimized GPU loading. Note that this is now
deprecated, as serialized vLLM models are now automatically
deprecated, as serialized vLLM models are now automatically
inferred as vLLM models.
inferred as vLLM models.
verify_hash: If True, the hashes of each tensor will be verified
against
verify_hash: If True, the hashes of each tensor will be verified
the hashes stored in the metadata. A `HashMismatchError`
will be
against
the hashes stored in the metadata. A `HashMismatchError`
raised if any of the hashes do not match.
will be
raised if any of the hashes do not match.
num_readers: Controls how many threads are allowed to read concurrently
num_readers: Controls how many threads are allowed to read concurrently
from the source file. Default is `None`, which will dynamically set
from the source file. Default is `None`, which will dynamically set
the number of readers based on the number of available
the number of readers based on the number of available
...
...
vllm/model_executor/models/aria.py
View file @
41f17bf2
...
@@ -143,16 +143,8 @@ class AriaProjector(nn.Module):
...
@@ -143,16 +143,8 @@ class AriaProjector(nn.Module):
projects ViT's outputs into MoE's inputs.
projects ViT's outputs into MoE's inputs.
Args:
Args:
patch_to_query_dict (dict): Maps patch numbers to their corresponding
config: [AriaConfig](https://huggingface.co/docs/transformers/main/model_doc/aria#transformers.AriaConfig)
query numbers,
containing projector configuration parameters.
e.g., {1225: 128, 4900: 256}. This allows for different query sizes
based on image resolution.
embed_dim (int): Embedding dimension.
num_heads (int): Number of attention heads.
kv_dim (int): Dimension of key and value.
ff_dim (int): Hidden dimension of the feed-forward network.
output_dim (int): Output dimension.
norm_layer (nn.Module): Normalization layer. Default is nn.LayerNorm.
Outputs:
Outputs:
A tensor with the shape of (batch_size, query_number, output_dim)
A tensor with the shape of (batch_size, query_number, output_dim)
...
@@ -282,8 +274,8 @@ class AriaTextMoELayer(nn.Module):
...
@@ -282,8 +274,8 @@ class AriaTextMoELayer(nn.Module):
Forward pass of the MoE Layer.
Forward pass of the MoE Layer.
Args:
Args:
hidden_states
(torch.Tensor)
: Input tensor of shape
(batch_size,
hidden_states: Input tensor of shape
sequence_length, hidden_size).
(batch_size,
sequence_length, hidden_size).
Returns:
Returns:
torch.Tensor: Output tensor after passing through the MoE layer.
torch.Tensor: Output tensor after passing through the MoE layer.
...
...
vllm/model_executor/models/bart.py
View file @
41f17bf2
...
@@ -401,8 +401,7 @@ class BartEncoderLayer(nn.Module):
...
@@ -401,8 +401,7 @@ class BartEncoderLayer(nn.Module):
def
forward
(
self
,
hidden_states
:
torch
.
Tensor
)
->
torch
.
Tensor
:
def
forward
(
self
,
hidden_states
:
torch
.
Tensor
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
hidden_states
hidden_states: torch.Tensor of *encoder* input embeddings.
torch.Tensor of *encoder* input embeddings.
Returns:
Returns:
Encoder layer output torch.Tensor
Encoder layer output torch.Tensor
"""
"""
...
@@ -490,10 +489,8 @@ class BartDecoderLayer(nn.Module):
...
@@ -490,10 +489,8 @@ class BartDecoderLayer(nn.Module):
)
->
torch
.
Tensor
:
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
decoder_hidden_states
decoder_hidden_states: torch.Tensor of *decoder* input embeddings.
torch.Tensor of *decoder* input embeddings.
encoder_hidden_states: torch.Tensor of *encoder* input embeddings.
encoder_hidden_states
torch.Tensor of *encoder* input embeddings.
Returns:
Returns:
Decoder layer output torch.Tensor
Decoder layer output torch.Tensor
"""
"""
...
@@ -584,12 +581,10 @@ class BartEncoder(nn.Module):
...
@@ -584,12 +581,10 @@ class BartEncoder(nn.Module):
)
->
torch
.
Tensor
:
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
input_ids
input_ids: Indices of *encoder* input sequence tokens in the
Indices of *encoder* input sequence tokens in the vocabulary.
vocabulary.
Padding will be ignored by default should you
Padding will be ignored by default should you provide it.
provide it.
positions: Positions of *encoder* input sequence tokens.
positions
Positions of *encoder* input sequence tokens.
Returns:
Returns:
Decoder output torch.Tensor
Decoder output torch.Tensor
"""
"""
...
@@ -663,14 +658,11 @@ class BartDecoder(nn.Module):
...
@@ -663,14 +658,11 @@ class BartDecoder(nn.Module):
)
->
torch
.
Tensor
:
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
decoder_input_ids
decoder_input_ids: Indices of *decoder* input sequence tokens
Indices of *decoder* input sequence tokens in the vocabulary.
in the vocabulary.
Padding will be ignored by default should you
Padding will be ignored by default should you provide it.
provide it.
decoder_positions: Positions of *decoder* input sequence tokens.
decoder_positions
encoder_hidden_states: Tensor of encoder output embeddings.
Positions of *decoder* input sequence tokens.
encoder_hidden_states:
Tensor of encoder output embeddings
Returns:
Returns:
Decoder output torch.Tensor
Decoder output torch.Tensor
"""
"""
...
@@ -732,16 +724,13 @@ class BartModel(nn.Module, SupportsQuant):
...
@@ -732,16 +724,13 @@ class BartModel(nn.Module, SupportsQuant):
encoder_positions
:
torch
.
Tensor
)
->
torch
.
Tensor
:
encoder_positions
:
torch
.
Tensor
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
input_ids
input_ids: Indices of *decoder* input sequence tokens
Indices of *decoder* input sequence tokens in the vocabulary.
in the vocabulary.
Padding will be ignored by default should you
Padding will be ignored by default should you provide it.
provide it.
positions: Positions of *decoder* input sequence tokens.
positions
encoder_input_ids: Indices of *encoder* input sequence tokens
Positions of *decoder* input sequence tokens.
in the vocabulary.
encoder_input_ids
encoder_positions: Positions of *encoder* input sequence tokens.
Indices of *encoder* input sequence tokens in the vocabulary.
encoder_positions:
Positions of *encoder* input sequence tokens.
Returns:
Returns:
Model output torch.Tensor
Model output torch.Tensor
"""
"""
...
@@ -848,14 +837,10 @@ class BartForConditionalGeneration(nn.Module, SupportsV0Only, SupportsQuant):
...
@@ -848,14 +837,10 @@ class BartForConditionalGeneration(nn.Module, SupportsV0Only, SupportsQuant):
)
->
torch
.
Tensor
:
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
input_ids
input_ids: torch.Tensor of *decoder* input token ids.
torch.Tensor of *decoder* input token ids.
positions: torch.Tensor of *decoder* position indices.
positions
encoder_input_ids: torch.Tensor of *encoder* input token ids.
torch.Tensor of *decoder* position indices.
encoder_positions: torch.Tensor of *encoder* position indices.
encoder_input_ids
torch.Tensor of *encoder* input token ids.
encoder_positions
torch.Tensor of *encoder* position indices
Returns:
Returns:
Output torch.Tensor
Output torch.Tensor
"""
"""
...
@@ -912,8 +897,7 @@ class MBartEncoderLayer(BartEncoderLayer):
...
@@ -912,8 +897,7 @@ class MBartEncoderLayer(BartEncoderLayer):
def
forward
(
self
,
hidden_states
:
torch
.
Tensor
)
->
torch
.
Tensor
:
def
forward
(
self
,
hidden_states
:
torch
.
Tensor
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
hidden_states
hidden_states: torch.Tensor of *encoder* input embeddings.
torch.Tensor of *encoder* input embeddings.
Returns:
Returns:
Encoder layer output torch.Tensor
Encoder layer output torch.Tensor
"""
"""
...
@@ -1035,12 +1019,10 @@ class MBartEncoder(nn.Module):
...
@@ -1035,12 +1019,10 @@ class MBartEncoder(nn.Module):
)
->
torch
.
Tensor
:
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
input_ids
input_ids: Indices of *encoder* input sequence tokens in the
Indices of *encoder* input sequence tokens in the vocabulary.
vocabulary.
Padding will be ignored by default should you
Padding will be ignored by default should you provide it.
provide it.
positions: Positions of *encoder* input sequence tokens.
positions
Positions of *encoder* input sequence tokens.
Returns:
Returns:
Decoder output torch.Tensor
Decoder output torch.Tensor
"""
"""
...
@@ -1116,14 +1098,11 @@ class MBartDecoder(nn.Module):
...
@@ -1116,14 +1098,11 @@ class MBartDecoder(nn.Module):
)
->
torch
.
Tensor
:
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
decoder_input_ids
decoder_input_ids: Indices of *decoder* input sequence tokens
Indices of *decoder* input sequence tokens in the vocabulary.
in the vocabulary.
Padding will be ignored by default should you
Padding will be ignored by default should you provide it.
provide it.
decoder_positions: Positions of *decoder* input sequence tokens.
decoder_positions
encoder_hidden_states: Tensor of encoder output embeddings.
Positions of *decoder* input sequence tokens.
encoder_hidden_states:
Tensor of encoder output embeddings
Returns:
Returns:
Decoder output torch.Tensor
Decoder output torch.Tensor
"""
"""
...
@@ -1185,16 +1164,13 @@ class MBartModel(nn.Module, SupportsQuant):
...
@@ -1185,16 +1164,13 @@ class MBartModel(nn.Module, SupportsQuant):
encoder_positions
:
torch
.
Tensor
)
->
torch
.
Tensor
:
encoder_positions
:
torch
.
Tensor
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
input_ids
input_ids: Indices of *decoder* input sequence tokens
Indices of *decoder* input sequence tokens in the vocabulary.
in the vocabulary.
Padding will be ignored by default should you
Padding will be ignored by default should you provide it.
provide it.
positions: Positions of *decoder* input sequence tokens.
positions
encoder_input_ids: Indices of *encoder* input sequence tokens
Positions of *decoder* input sequence tokens.
in the vocabulary.
encoder_input_ids
encoder_positions: Positions of *encoder* input sequence tokens.
Indices of *encoder* input sequence tokens in the vocabulary.
encoder_positions:
Positions of *encoder* input sequence tokens.
Returns:
Returns:
Model output torch.Tensor
Model output torch.Tensor
"""
"""
...
...
vllm/model_executor/models/blip2.py
View file @
41f17bf2
...
@@ -678,7 +678,6 @@ class Blip2ForConditionalGeneration(nn.Module, SupportsMultiModal, SupportsPP,
...
@@ -678,7 +678,6 @@ class Blip2ForConditionalGeneration(nn.Module, SupportsMultiModal, SupportsPP,
Args:
Args:
input_ids: Flattened (concatenated) input_ids corresponding to a
input_ids: Flattened (concatenated) input_ids corresponding to a
batch.
batch.
pixel_values: The pixels in each input image.
Info:
Info:
[Blip2ImageInputs][]
[Blip2ImageInputs][]
...
...
vllm/model_executor/models/donut.py
View file @
41f17bf2
...
@@ -79,10 +79,8 @@ class DonutLanguageForConditionalGeneration(nn.Module, SupportsV0Only):
...
@@ -79,10 +79,8 @@ class DonutLanguageForConditionalGeneration(nn.Module, SupportsV0Only):
)
->
torch
.
Tensor
:
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
input_ids
input_ids: torch.Tensor of *decoder* input token ids.
torch.Tensor of *decoder* input token ids.
positions: torch.Tensor of *decoder* position indices.
positions
torch.Tensor of *decoder* position indices.
Returns:
Returns:
Output torch.Tensor
Output torch.Tensor
"""
"""
...
@@ -351,14 +349,10 @@ class DonutForConditionalGeneration(nn.Module, SupportsMultiModal,
...
@@ -351,14 +349,10 @@ class DonutForConditionalGeneration(nn.Module, SupportsMultiModal,
)
->
torch
.
Tensor
:
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
input_ids
input_ids: torch.Tensor of *decoder* input token ids.
torch.Tensor of *decoder* input token ids.
positions: torch.Tensor of *decoder* position indices.
positions
encoder_input_ids: torch.Tensor of *encoder* input token ids.
torch.Tensor of *decoder* position indices.
encoder_positions: torch.Tensor of *encoder* position indices
encoder_input_ids
torch.Tensor of *encoder* input token ids.
encoder_positions
torch.Tensor of *encoder* position indices
Returns:
Returns:
Output torch.Tensor
Output torch.Tensor
"""
"""
...
...
vllm/model_executor/models/florence2.py
View file @
41f17bf2
...
@@ -631,16 +631,14 @@ class Florence2LanguageModel(nn.Module):
...
@@ -631,16 +631,14 @@ class Florence2LanguageModel(nn.Module):
)
->
torch
.
Tensor
:
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
input_ids
input_ids
: Indices of *decoder* input sequence tokens
Indices of *decoder* input sequence tokens
in the vocabulary.
in the vocabulary.
Padding will be ignored by default should you
Padding will be ignored by default should you
provide it.
provide it.
positions
positions: Positions of *decoder* input sequence tokens.
Positions of *decoder* input sequence tokens.
encoder_input_ids: Indices of *encoder* input sequence tokens
encoder_input_ids
in the vocabulary.
Indices of *encoder* input sequence tokens in the vocabulary.
encoder_positions: Positions of *encoder* input sequence tokens.
encoder_positions:
Positions of *encoder* input sequence tokens.
Returns:
Returns:
Model output torch.Tensor
Model output torch.Tensor
"""
"""
...
@@ -699,14 +697,10 @@ class Florence2LanguageForConditionalGeneration(nn.Module, SupportsV0Only):
...
@@ -699,14 +697,10 @@ class Florence2LanguageForConditionalGeneration(nn.Module, SupportsV0Only):
)
->
torch
.
Tensor
:
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
input_ids
input_ids: torch.Tensor of *decoder* input token ids.
torch.Tensor of *decoder* input token ids.
positions: torch.Tensor of *decoder* position indices.
positions
encoder_input_ids: torch.Tensor of *encoder* input token ids.
torch.Tensor of *decoder* position indices.
encoder_positions: torch.Tensor of *encoder* position indices
encoder_input_ids
torch.Tensor of *encoder* input token ids.
encoder_positions
torch.Tensor of *encoder* position indices
Returns:
Returns:
Output torch.Tensor
Output torch.Tensor
"""
"""
...
@@ -1068,14 +1062,10 @@ class Florence2ForConditionalGeneration(nn.Module, SupportsMultiModal,
...
@@ -1068,14 +1062,10 @@ class Florence2ForConditionalGeneration(nn.Module, SupportsMultiModal,
)
->
torch
.
Tensor
:
)
->
torch
.
Tensor
:
r
"""
r
"""
Args:
Args:
input_ids
input_ids: torch.Tensor of *decoder* input token ids.
torch.Tensor of *decoder* input token ids.
positions: torch.Tensor of *decoder* position indices.
positions
encoder_input_ids: torch.Tensor of *encoder* input token ids.
torch.Tensor of *decoder* position indices.
encoder_positions: torch.Tensor of *encoder* position indices
encoder_input_ids
torch.Tensor of *encoder* input token ids.
encoder_positions
torch.Tensor of *encoder* position indices
Returns:
Returns:
Output torch.Tensor
Output torch.Tensor
"""
"""
...
...
vllm/model_executor/models/glm4_1v.py
View file @
41f17bf2
...
@@ -1599,17 +1599,10 @@ class Glm4vForConditionalGeneration(nn.Module, SupportsMultiModal,
...
@@ -1599,17 +1599,10 @@ class Glm4vForConditionalGeneration(nn.Module, SupportsMultiModal,
**NOTE**: If mrope is enabled (default setting for GLM-4V
**NOTE**: If mrope is enabled (default setting for GLM-4V
opensource models), the shape will be `(3, seq_len)`,
opensource models), the shape will be `(3, seq_len)`,
otherwise it will be `(seq_len,).
otherwise it will be `(seq_len,).
pixel_values: Pixel values to be fed to a model.
intermediate_tensors: Optional intermediate tensors for pipeline
`None` if no images are passed.
parallelism.
image_grid_thw: Tensor `(n_images, 3)` of image 3D grid in LLM.
inputs_embeds: Optional pre-computed input embeddings.
`None` if no images are passed.
**kwargs: Additional keyword arguments.
pixel_values_videos: Pixel values of videos to be fed to a model.
`None` if no videos are passed.
video_grid_thw: Tensor `(n_videos, 3)` of video 3D grid in LLM.
`None` if no videos are passed.
second_per_grid_ts: Tensor `(num_videos)` of video time interval (
in seconds) for each grid along the temporal dimension in the
3D position IDs. `None` if no videos are passed.
"""
"""
if
intermediate_tensors
is
not
None
:
if
intermediate_tensors
is
not
None
:
inputs_embeds
=
None
inputs_embeds
=
None
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment