- 04 Nov, 2018 2 commits
- 03 Nov, 2018 37 commits
-
-
thomwolf authored
-
thomwolf authored
-
-
thomwolf authored
-
Ubuntu authored
-
VictorSanh authored
-
Tim Rault authored
-
VictorSanh authored
-
VictorSanh authored
-
VictorSanh authored
-
Tim Rault authored
-
-
thomwolf authored
-
VictorSanh authored
-
Ubuntu authored
-
thomwolf authored
-
thomwolf authored
-
thomwolf authored
-
-
thomwolf authored
-
VictorSanh authored
-
VictorSanh authored
-
VictorSanh authored
Create DataParallel model if several GPUs
-
VictorSanh authored
-
VictorSanh authored
-
VictorSanh authored
-
VictorSanh authored
I seriously don't understand why they defined num_train_epochs as a float in the originial tf code. I Will change it at the end to avoir merge conflicts for now.
-
VictorSanh authored
-
Tim Rault authored
-
-
thomwolf authored
-
Tim Rault authored
-
thomwolf authored
-
thomwolf authored
-
thomwolf authored
-
-
thomwolf authored
-
- 02 Nov, 2018 1 commit
-
-
VictorSanh authored
Please review @thomwolf but i think this is equivqlent (and it mimics the loss computation of the original loss)
-