Skip to content

Commit 59fe789

Browse files
authored
Add a note for TPU issues for deberta_v3 (#758)
1 parent 773c884 commit 59fe789

File tree

2 files changed

+6
-0
lines changed

2 files changed

+6
-0
lines changed

keras_nlp/models/deberta_v3/deberta_v3_backbone.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -47,6 +47,9 @@ class DebertaV3Backbone(Backbone):
4747
dimensions. To load preset architectures and weights, use the `from_preset`
4848
constructor.
4949
50+
Note: `DebertaV3Backbone` has a performance issue on TPUs, and we recommend
51+
other models for TPU training and inference.
52+
5053
Disclaimer: Pre-trained models are provided on an "as is" basis, without
5154
warranties or conditions of any kind. The underlying model is provided by a
5255
third party and subject to a separate license, available

keras_nlp/models/deberta_v3/deberta_v3_classifier.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -44,6 +44,9 @@ class DebertaV3Classifier(Task):
4444
`fit()`, `predict()`, and `evaluate()`. This is done by default when
4545
creating the model with `from_preset()`.
4646
47+
Note: `DebertaV3Backbone` has a performance issue on TPUs, and we recommend
48+
other models for TPU training and inference.
49+
4750
Disclaimer: Pre-trained models are provided on an "as is" basis, without
4851
warranties or conditions of any kind. The underlying model is provided by a
4952
third party and subject to a separate license, available

0 commit comments

Comments
 (0)