Unverified Commit 6d70198b authored by Kazuhiro Serizawa's avatar Kazuhiro Serizawa Committed by GitHub
Browse files

[Doc] Fix typo (#11666)


Signed-off-by: default avatarKazuhiro Serizawa <nserihiro@gmail.com>
parent f962f426
......@@ -39,7 +39,7 @@ class RejectionSampler(SpecDecodeStochasticBaseSampler):
strict_mode: Whether or not to perform shape/device/dtype checks
during sampling. This catches correctness issues but adds
nontrivial latency.
use_falshinfer: We will use this parameter to determine whether
use_flashinfer: We will use this parameter to determine whether
to use the FlashInfer rejection sampling kernel or not. If it's
None, we will use the default value from the environment variable.
This parameter is only used for testing purposes.
......
......@@ -44,7 +44,7 @@ class TopKTopPSampler(nn.Module):
logger.warning(
"FlashInfer is not available. Falling back to the PyTorch-"
"native implementation of top-p & top-k sampling. For the "
"best performance, please install FalshInfer.")
"best performance, please install FlashInfer.")
self.forward = self.forward_native
else:
self.forward = self.forward_native
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment