• moto's avatar
    Add merge_tokens / TokenSpan (#3535) · 30668afb
    moto authored
    Summary:
    This commit adds `merge_tokens` function which removes repeated tokens from CTC token sequences returned from `forced_align`.
    
    Resolving repeated tokens is a necessary step and almost universal, thus it makes sense to have such helper function in torchaudio.
    
    Pull Request resolved: https://github.com/pytorch/audio/pull/3535
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48111202
    
    Pulled By: mthrok
    
    fbshipit-source-id: 25354bfa210aa5c03f8c1d3e201f253ca3761b24
    30668afb
functional.rst 1.66 KB