Commits · 31ca3b97ebc1ca37b1d4db6ff3bf062fcbf16b5d · ModelZoo / ResNet50_tensorflow

21 Jul, 2020 1 commit

[MultiheadAttention] Apply suggestions from RFC: · 36101ab4

Hongkun Yu authored Jul 21, 2020

(1) call() consume kwargs and implement _build_from_signature (layers added inside init_scope)
(2) make build/call_attention as public.

PiperOrigin-RevId: 322454619

36101ab4

08 Jul, 2020 1 commit
- Migrate all DenseEinsum to tf.keras.experimental.EinsumDense · 43f5340f
  Hongkun Yu authored Jul 08, 2020
```
PiperOrigin-RevId: 320240466
```
  43f5340f
22 Jun, 2020 1 commit
- Internal change · 8aa44501
  Hongkun Yu authored Jun 21, 2020
```
PiperOrigin-RevId: 317596394
```
  8aa44501
19 Jun, 2020 1 commit
- Move TransformerDecoderLayer to modeling/ · 58e805e0
  Hongkun Yu authored Jun 19, 2020
```
PiperOrigin-RevId: 317330705
```
  58e805e0
29 May, 2020 1 commit

Proposes the full functionality of MultiHeadAttention layer. This change first... · 8a13ca4e

Hongkun Yu authored May 29, 2020

Proposes the full functionality of MultiHeadAttention layer. This change first goes to model garden NLP library.

PiperOrigin-RevId: 313847485

8a13ca4e

28 May, 2020 1 commit

Use float32 activation in Transformer. · 94b1efc1

Reed Wanderman-Milne authored May 28, 2020

Float32 is used if the model uses mixed precision with bfloat16. Float16 activation are unchanged.

The motivation is that BERT with the LAMB optimizer with a gelu activation has an unstable loss when gelu is in bfloat16. Unfortunately, it is not easy to check if the LAMB optimizer and gelu is used, and perhaps there are other cases that work better with float32 activations instead of bfloat16 activations, so we always do the activation in float32 instead of bfloat16.

PiperOrigin-RevId: 313618322

94b1efc1

12 May, 2020 2 commits
- Make Transformer Layer back compatible. · ac5fff19
  Hongkun Yu authored May 12, 2020
```
PiperOrigin-RevId: 311165658
```
  ac5fff19
- Internal Change · 2878da2e
  Chen Chen authored May 12, 2020
```
PiperOrigin-RevId: 311072125
```
  2878da2e
10 May, 2020 1 commit
- Internal change · 09c5ae2f
  Hongkun Yu authored May 09, 2020
```
PiperOrigin-RevId: 310767440
```
  09c5ae2f
05 May, 2020 1 commit
- Introduce output_range for transformer layer and encoder last layer. · 563d923a
  Hongkun Yu authored May 05, 2020
```
PiperOrigin-RevId: 310032518
```
  563d923a
21 Apr, 2020 1 commit
- Internal change · 330b34fe
  Hongkun Yu authored Apr 21, 2020
```
PiperOrigin-RevId: 307689094
```
  330b34fe
01 Apr, 2020 1 commit
- Add a CompiledTransformer layer, which is compiled with XLA. · a3be7365
  George Karpenkov authored Apr 01, 2020
```
PiperOrigin-RevId: 304222530
```
  a3be7365
27 Mar, 2020 1 commit
- Use mixed precision for gelu intermediate activation in BERT SQuAD model · 8849285f
  A. Unique TensorFlower authored Mar 27, 2020
```
PiperOrigin-RevId: 303407939
```
  8849285f
09 Mar, 2020 1 commit
- Rename Attention to MultiheadAttention · f3a29bdd
  Hongkun Yu authored Mar 09, 2020
```
PiperOrigin-RevId: 299901483
```
  f3a29bdd
03 Mar, 2020 2 commits
- Attributes->Arguments. Be consistent with keras style. · 651677f5
  Hongkun Yu authored Mar 03, 2020
```
PiperOrigin-RevId: 298692558
```
  651677f5
- Remove @tf.function(experimental_compile=True) annotation · add0620a
  George Karpenkov authored Mar 02, 2020
```
Removed with plans to re-add later once the feature stabilizes more.

PiperOrigin-RevId: 298486867
```
  add0620a
26 Feb, 2020 2 commits
- Add a class-based decorator for only adding tf.function in TF2 mode · f2cf0cf4
  George Karpenkov authored Feb 26, 2020
```
PiperOrigin-RevId: 297366405
```
  f2cf0cf4
- Do not recreate tf.function on different calls to the same layer · e0c2c302
  George Karpenkov authored Feb 26, 2020
```
PiperOrigin-RevId: 297366158
```
  e0c2c302
25 Feb, 2020 1 commit
- Only apply tf.function(experimental_compile=True) in eager mode · fe1fa6d6
  George Karpenkov authored Feb 25, 2020
```
Application in graph mode still leads to some crashes.

PiperOrigin-RevId: 297144398
```
  fe1fa6d6
21 Feb, 2020 1 commit

Enable XLA compilation using `@tf.function(experimental_compile=True) for transformer layer. · 393c1399

George Karpenkov authored Feb 21, 2020

To debug the tf.function this API can be used: https://www.tensorflow.org/api_docs/python/tf/config/experimental_run_functions_eagerly

PiperOrigin-RevId: 296458870

393c1399

08 Feb, 2020 1 commit
- Internal change · 347f4044
  Zongwei Zhou authored Feb 07, 2020
```
PiperOrigin-RevId: 293958491
```
  347f4044
21 Jan, 2020 1 commit

Remove compute_output_shape. · 0e0a94a6

Hongkun Yu authored Jan 21, 2020

Keras: "manual" shape inference is only required if the layer is dynamic (otherwise we use TF's static shape inference capabilities)

PiperOrigin-RevId: 290821518

0e0a94a6

20 Nov, 2019 1 commit
- Internal change · e16594d1
  Chen Chen authored Nov 20, 2019
```
PiperOrigin-RevId: 281473612
```
  e16594d1
13 Nov, 2019 1 commit
- Internal change · c3bd5082
  A. Unique TensorFlower authored Nov 12, 2019
```
PiperOrigin-RevId: 280123567
```
  c3bd5082
11 Nov, 2019 1 commit

Release keras bert: · f1d35b4e

Hongkun Yu authored Nov 11, 2019

- Update classifier example.
- Add new converted checkpoints.
- Update benchmark,

PiperOrigin-RevId: 279762797

f1d35b4e