-
Yih-Dar authored
* Add cross attentions to TFGPT2Model * change to is_pt_tf_cross_test * A minor correction to a comment * Remove n_ctx when creating self.crossattention Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
bd21ed40