• VictorSanh's avatar
    Fix loss · 72ab1039
    VictorSanh authored
    Please review @thomwolf but i think this is equivqlent (and it mimics the loss computation of the original loss)
    72ab1039
modeling_pytorch.py 21.6 KB