"docs/source/en/internal/generation_utils.md" did not exist on "3f43d824b909ec92cde5311e9be016767c0fb11b"
  • uchuhimo's avatar
    fix: use bool instead of uint8/byte in Deberta/DebertaV2/SEW-D to make it... · b4698b7e
    uchuhimo authored
    
    fix: use bool instead of uint8/byte in Deberta/DebertaV2/SEW-D to make it compatible with TensorRT (#23683)
    
    * Use bool instead of uint8/byte in DebertaV2 to make it compatible with TensorRT
    
    TensorRT cannot accept onnx graph with uint8/byte intermediate tensors. This PR uses bool tensors instead of unit8/byte tensors to make the exported onnx file can work with TensorRT.
    
    * fix: use bool instead of uint8/byte in Deberta and SEW-D
    
    ---------
    Co-authored-by: default avatarYuxian Qiu <yuxianq@nvidia.com>
    b4698b7e
modeling_deberta.py 57.2 KB