[Doc] Fix typo (#11666)
Signed-off-by: Kazuhiro Serizawa <nserihiro@gmail.com>
This commit is contained in:
parent
f962f426bc
commit
6d70198b17
@ -39,7 +39,7 @@ class RejectionSampler(SpecDecodeStochasticBaseSampler):
|
||||
strict_mode: Whether or not to perform shape/device/dtype checks
|
||||
during sampling. This catches correctness issues but adds
|
||||
nontrivial latency.
|
||||
use_falshinfer: We will use this parameter to determine whether
|
||||
use_flashinfer: We will use this parameter to determine whether
|
||||
to use the FlashInfer rejection sampling kernel or not. If it's
|
||||
None, we will use the default value from the environment variable.
|
||||
This parameter is only used for testing purposes.
|
||||
|
@ -44,7 +44,7 @@ class TopKTopPSampler(nn.Module):
|
||||
logger.warning(
|
||||
"FlashInfer is not available. Falling back to the PyTorch-"
|
||||
"native implementation of top-p & top-k sampling. For the "
|
||||
"best performance, please install FalshInfer.")
|
||||
"best performance, please install FlashInfer.")
|
||||
self.forward = self.forward_native
|
||||
else:
|
||||
self.forward = self.forward_native
|
||||
|
Loading…
x
Reference in New Issue
Block a user