- 09 Mar, 2020 2 commits
-
-
Patrick von Platen authored
-
Patrick von Platen authored
-
- 08 Mar, 2020 5 commits
-
-
patrickvonplaten authored
-
patrickvonplaten authored
-
patrickvonplaten authored
-
patrickvonplaten authored
-
patrickvonplaten authored
-
- 06 Mar, 2020 1 commit
-
-
Sam Shleifer authored
-
- 05 Mar, 2020 9 commits
-
-
patrickvonplaten authored
-
Sam Shleifer authored
* improved documentation
-
Lysandre Debut authored
* Pass kwargs to configuration * Setter * test
-
Lysandre Debut authored
-
sshleifer authored
-
sshleifer authored
-
Julien Chaumond authored
-
Julien Chaumond authored
-
Lysandre authored
-
- 04 Mar, 2020 2 commits
-
-
Patrick von Platen authored
-
patrickvonplaten authored
-
- 03 Mar, 2020 7 commits
-
-
Gunnlaugur Thor Briem authored
-
Gunnlaugur Thor Briem authored
And only run the test on TF*MainLayer classes so marked.
-
Gunnlaugur Thor Briem authored
When supplied by Keras deserialization, the config parameter to initializers will be a dict. So intercept it and convert to PretrainedConfig object (and store in instance attribute for get_config to get at it) before passing to the actual initializer. To accomplish this, and repeat as little code as possible, use a class decorator on TF*MainLayer classes.
-
Sam Shleifer authored
-
Julien Chaumond authored
Adopted best practice set by @patrickvonplaten of commenting lines run on fairseq, for easy comparison also see #3020
-
Gunnlaugur Thor Briem authored
-
Patrick von Platen authored
* add first copy past test to tf 2 generate * add tf top_k_top_p_filter fn * add generate function for TF * add generate function for TF * implemented generate for all models expect transfoXL * implemented generate for all models expect transfoXL * implemented generate for all models expect transfoXL * make style * change permission of test file to correct ones * delete ipdb * delete ipdb * fix bug and finish simple gpt2 integration test * clean test file * clean test file * make style * make style * make style * make style * change import style * change import style * make style * make style * add decorators * add decorators * fix tf ctrl bug dim => axis in TF * make style * make style * refactored test file * refactored test file * take out test_torch_tf_conversion if nothing is defined * take out test_torch_tf_conversion if nothing is defined * remove useless files * remove useless files * fix conflicts * fix conflicts * fix conflicts * fix conflicts * fix conflicts * solve conflicts * solve conflicts * fix conflicts * fix conflicts * merge conflicts * delete ipdb * exposed top_k_top_p_filtering fns * delete weirdly created w! file * add comment to test tf common modeling * fix conflicts * fix conflicts * make style * merge conflicts * make style * change tf.tensor.shape to shape_list(tensor)
-
- 02 Mar, 2020 6 commits
-
-
Julien Chaumond authored
* debug env * Restrict TF GPU memory * Fixup * One more test * rm debug logs * Fixup
-
Lysandre Debut authored
* Pipeline doc initial commit * pipeline abstraction * Remove modelcard argument from pipeline * Task-specific pipelines can be instantiated with no model or tokenizer * All pipelines doc
-
Julien Chaumond authored
cc @patrickvonplaten
-
Patrick von Platen authored
* correct greedy generation when doing beam search * improve comment
-
Patrick von Platen authored
* force pad_token_id to be set before padding * fix tests and forbid padding without having a padding_token_id set
-
Sam Shleifer authored
`generate` code that produces 99% identical summarizations to fairseq on CNN test data, with caching.
-
- 27 Feb, 2020 2 commits
-
-
Martin Malmsten authored
-
Martin Malmsten authored
-
- 26 Feb, 2020 5 commits
-
-
Julien Chaumond authored
-
Julien Chaumond authored
-
Patrick von Platen authored
* fix issue and add some tests * fix issue and add some tests * updated doc string gpt2
-
Julien Chaumond authored
* Fix tests on GPU (torch) * Fix bart slow tests Co-authored-by:Sam Shleifer <sshleifer@gmail.com>
-
Sam Shleifer authored
-
- 25 Feb, 2020 1 commit
-
-
Patrick von Platen authored
* add first files * add xlm roberta integration tests * make style * flake 8 issues solved
-