Commits · dfa7b580e9863c38c2f0e0dedf0958c2eab9cb48 · chenpangpang / transformers

26 Apr, 2024 1 commit

[`BERT`] Add support for sdpa (#28802) · dfa7b580

JB (Don) authored Apr 26, 2024

* Adding SDPA support for BERT

* Using the proper input name for testing model input in inference()

* Adding documentation for SDPA in BERT model page

* Use the stable link for the documentation

* Adding a gate to only call .contiguous() for torch < 2.2.0

* Additions and fixes to the documentation

* Minor updates to documentation

* Adding extra requirements needed for the contiguous() bug

* Adding "Adapted from" in plcae of the "Copied from"

* Add benchmark speedup tables to the documentation

* Minor fixes to the documentation

* Use ClapText as a replacemenet for Bert in the Copied-From

* Some more fixes for the fix-copies references

* Overriding the test_eager_matches_sdpa_generate in bert tests to not load with low_cpu_mem_usage

[test all]

* Undo changes to separate test

* Refactored SDPA self attention code for KV projections

* Change use_sdpa to attn_implementation

* Fix test_sdpa_can_dispatch_on_flash by preparing input (required for MultipleChoice models)

dfa7b580

11 Mar, 2024 1 commit

Fixed broken link (#29558) · b45c0f55

Amrit Gupta authored Mar 11, 2024

Fixed broken link for Resources -> Token Classification -> Finetuning BERT for named-entity

b45c0f55

03 Nov, 2023 1 commit

[Docs] Model_doc structure/clarity improvements (#26876) · 5964f820

Maria Khalusova authored Nov 03, 2023

* first batch of structure improvements for model_docs

* second batch of structure improvements for model_docs

* more structure improvements for model_docs

* more structure improvements for model_docs

* structure improvements for cv model_docs

* more structural refactoring

* addressed feedback about image processors

5964f820

20 Jun, 2023 1 commit

Migrate doc files to Markdown. (#24376) · eb849f66

Sylvain Gugger authored Jun 20, 2023



* Rename index.mdx to index.md

* With saved modifs

* Address review comment

* Treat all files

* .mdx -> .md

* Remove special char

* Update utils/tests_fetcher.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

---------
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

eb849f66

17 Mar, 2023 1 commit
- fix(docs): fix task guide links in model docs (#22226) · 074490b2
  Seb0 authored Mar 17, 2023
```
fix(docs): task guide links in model docs
```
  074490b2
21 Feb, 2023 1 commit

Adding task guides to resources (#21704) · 78a53d59

Maria Khalusova authored Feb 21, 2023



* added resources: links to task guides that support these models

* minor polishing

* conflict resolved

* link fix

* Update docs/source/en/model_doc/vision-encoder-decoder.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

78a53d59

15 Feb, 2023 1 commit

Refactor model summary (#21408) · 7a5533b2

Steven Liu authored Feb 15, 2023

* first draft of model summary

* restructure docs

* finish first draft

* ✨minor reviews and edits

* apply feedbacks

* save important info, create new page for attention

* add attention doc to toctree

* ✨ few more minor fixes

7a5533b2

01 Nov, 2022 1 commit

Add BERT resources (#19852) · dec8578e

Steven Liu authored Nov 01, 2022

* add resources for bert

* add course chapters

* apply reviews

* add pipeline icons and community resource

* fix buttons

dec8578e

27 Jun, 2022 1 commit

Add a TF in-graph tokenizer for BERT (#17701) · ee0d001d

Matt authored Jun 27, 2022

* Add a TF in-graph tokenizer for BERT

* Add from_pretrained

* Add proper truncation, option handling to match other tokenizers

* Add proper imports and guards

* Add test, fix all the bugs exposed by said test

* Fix truncation of paired texts in graph mode, more test updates

* Small fixes, add a (very careful) test for savedmodel

* Add tensorflow-text dependency, make fixup

* Update documentation

* Update documentation

* make fixup

* Slight changes to tests

* Add some docstring examples

* Update tests

* Update tests and add proper lowercasing/normalization

* make fixup

* Add docstring for padding!

* Mark slow tests

* make fixup

* Fall back to BertTokenizerFast if BertTokenizer is unavailable

* Fall back to BertTokenizerFast if BertTokenizer is unavailable

* make fixup

* Properly handle tensorflow-text dummies

ee0d001d

03 May, 2022 1 commit

[FlaxBert] Add ForCausalLM (#16995) · cd9274d0

Sanchit Gandhi authored May 03, 2022

* [FlaxBert] Add ForCausalLM

* make style

* fix output attentions

* Add RobertaForCausalLM

* remove comment

* fix fx-to-pt model loading

* remove comment

* add modeling tests

* add enc-dec model tests

* add big_bird

* add electra

* make style

* make repo-consitency

* add to docs

* remove roberta test

* quality

* amend cookiecutter

* fix attention_mask bug in flax bert model tester

* tighten pt-fx thresholds to 1e-5

* add 'copied from' statements

* amend 'copied from' statements

* amend 'copied from' statements

* quality

cd9274d0

04 Apr, 2022 1 commit

Enable doc in Spanish (#16518) · b9a768b3

Sylvain Gugger authored Apr 04, 2022

* Reorganize doc for multilingual support

* Fix style

* Style

* Toc trees

* Adapt templates

b9a768b3

17 Dec, 2021 1 commit

Convert rst to mdx bert (#14806) · 77d6c826

Lysandre Debut authored Dec 17, 2021



* BERT to mdx
mdx :)
c

* Update docs/source/model_doc/bert.mdx
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Remove all
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Julien Chaumond <julien@huggingface.co>

77d6c826