- 11 Dec, 2021 1 commit
-
-
Stas Bekman authored
* document MoE model approach * additional info from Samyam * fix
-
- 01 Dec, 2021 1 commit
-
-
Stas Bekman authored
* [doc] bf16/tf32 guide * expand * expand * Update docs/source/performance.md Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 19 Nov, 2021 1 commit
-
-
NielsRogge authored
* Add integration test * Fix typo
-
- 18 Nov, 2021 1 commit
-
-
NielsRogge authored
* First draft * More improvements * Improve conversion script * Fix init weights for layer norm * Fix correct model for conversion script * Don't tie input and output embeddings * Add print statements for debugging * Add print statements for debugging * Fix vocab size of model * Improve documentation, remove fast tokenizer * Add ImageGPTForImageClassification, improve docs * Fix docs issue * Set verbosity level back to info * Improve tests * Fix tests and add figure * Delete tokenizer file * Remove ImageGPTTokenizer from init files * Remove ImageGPTLayer from init files * Remove ImageGPT tokenizer from docs * First draft of ImageGPTFeatureExtractor * Fix typo * Fix bug * More improvements * Apply suggestions from code review, add tests for feature extractor * Fix layernorm * Update save_pretrained method * Fix issue * Make all tests of ImageGPTFeatureExtractor pass * Update code examples * Rename model inputs to pixel_values * Improve code examples * Update init_weights to post_init * Fix post_init
-
- 22 Oct, 2021 1 commit
-
-
Kevin Ko authored
* Update TP parallel GEMM image * Delete parallelism-tp-parallel_gemm.png * Update parallelism-tp-parallel_gemm.png
-
- 10 Jul, 2021 1 commit
-
-
Stas Bekman authored
* wip * complete the doc * missing img * improve * correction * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 15 Jun, 2021 2 commits
-
-
Sylvain Gugger authored
-
Sylvain Gugger authored
* Add course banner * Update course banner
-
- 05 Feb, 2021 1 commit
-
-
Patrick von Platen authored
* add big bird * change teacher to mentor * add proposal template * adapt template * delete old template * correct some links * finish template * create big bird from template * add big bird * improve boxes * finish boxes * add pointers for BigBird * finish big bird * up * up * up * up * apply lysandres and sylvains suggestions * delete bogus file * correct markdown * try different style * try different style * finalize
-
- 01 Feb, 2021 1 commit
-
-
Patrick von Platen authored
* add new model logic * fix docs * change structure * improve add_new_model * push new changes * up * up * correct spelling * improve docstring * correct line length * update readme * correct links * correct typos * only add rst file for now * Apply suggestions from code review 1 Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> Co-authored-by:
Bram Vanroy <Bram.Vanroy@UGent.be> * Apply suggestions from code review Co-authored-by:
Bram Vanroy <Bram.Vanroy@UGent.be> Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> * Apply suggestions from code review Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> * Apply suggestions from code review Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> Co-authored-by:
Stefan Schweter <stefan@schweter.it> Co-authored-by:
Bram Vanroy <Bram.Vanroy@UGent.be> * Apply suggestions from code review Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> Co-authored-by:
Pierric Cistac <Pierrci@users.noreply.github.com> * finish adding all suggestions * make style * apply Niels feedback * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply sylvains suggestions Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> Co-authored-by:
Bram Vanroy <Bram.Vanroy@UGent.be> Co-authored-by:
Stefan Schweter <stefan@schweter.it> Co-authored-by:
Pierric Cistac <Pierrci@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 07 Jul, 2020 1 commit
-
-
Joe Davison authored
* add first draft ppl guide * upload imgs * expand on strides * ref typo * rm superfluous past var * add tokenization disclaimer
-
- 05 Jun, 2020 1 commit
-
-
Sylvain Gugger authored
* Add model summary * Add link to pretrained models
-
- 06 Jan, 2020 2 commits
-
-
alberduris authored
-
alberduris authored
-
- 26 Sep, 2019 1 commit
-
-
thomwolf authored
-
- 08 Jul, 2019 1 commit
-
-
LysandreJik authored
-