- 17 Nov, 2021 1 commit
-
-
Krzysztof Sadowski authored
* upload * cleanup of unused code * default gpu training/inference * layer norm instead of batch norm * fix for default inference mode * simplified embedding forward method Co-authored-by:
xiang song(charlie.song) <classicxsong@gmail.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com> Co-authored-by:
Mufei Li <mufeili1996@gmail.com>
-