Unverified Commit 8f840edd authored by Arthur's avatar Arthur Committed by GitHub
Browse files

[`Whisper`] Nit converting the tokenizer (#27349)

* `nospeech` instead of `nocaption` for the no speech token

* oups
parent cc9f27bb
...@@ -235,7 +235,7 @@ def convert_tiktoken_to_hf( ...@@ -235,7 +235,7 @@ def convert_tiktoken_to_hf(
"<|transcribe|>", "<|transcribe|>",
"<|startoflm|>", "<|startoflm|>",
"<|startofprev|>", "<|startofprev|>",
"<|nocaptions|>", "<|nospeech|>",
"<|notimestamps|>", "<|notimestamps|>",
] ]
# these are special tokens, not normalized # these are special tokens, not normalized
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment