Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
dynamo
Commits
bf1998f0
"...ssh:/git@developer.sourcefind.cn:2222/OpenDAS/dynamo.git" did not exist on "01bfbea15f010a6dfe448f09f9fcb5895d096067"
Unverified
Commit
bf1998f0
authored
Jul 18, 2025
by
jthomson04
Committed by
GitHub
Jul 19, 2025
Browse files
fix: Don't detokenize twice in TRT-LLM examples (#1955)
parent
343a4814
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
0 deletions
+3
-0
components/backends/trtllm/src/dynamo/trtllm/main.py
components/backends/trtllm/src/dynamo/trtllm/main.py
+3
-0
No files found.
components/backends/trtllm/src/dynamo/trtllm/main.py
View file @
bf1998f0
...
@@ -126,6 +126,9 @@ async def init(runtime: DistributedRuntime, config: Config):
...
@@ -126,6 +126,9 @@ async def init(runtime: DistributedRuntime, config: Config):
default_sampling_params
.
_setup
(
tokenizer
)
default_sampling_params
.
_setup
(
tokenizer
)
default_sampling_params
.
stop
=
None
default_sampling_params
.
stop
=
None
# We already detokenize inside HandlerBase. No need to also do it in TRTLLM.
default_sampling_params
.
detokenize
=
False
async
with
get_tensorrtllm_engine
(
engine_args
)
as
engine
:
async
with
get_tensorrtllm_engine
(
engine_args
)
as
engine
:
endpoint
=
component
.
endpoint
(
config
.
endpoint
)
endpoint
=
component
.
endpoint
(
config
.
endpoint
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment