• Patrick von Platen's avatar
    [T5] make decoder input ids optional for t5 training (#3521) · 75ec6c9e
    Patrick von Platen authored
    * make decoder input ids optional for t5 training
    
    * lm_lables should not be shifted in t5
    
    * add tests
    
    * finish shift right functionality for PT T5
    
    * move shift right to correct class
    
    * cleaner code
    
    * replace -100 values with pad token id
    
    * add assert statement
    
    * remove unnecessary for loop
    
    * make style
    75ec6c9e
test_modeling_tf_t5.py 6.67 KB