You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Jul 10, 2025. It is now read-only.
@@ -46,16 +46,16 @@ Some major differences between CPU and TPU Graph:
46
46
VarHandleOp, and consumed by ReadVariableOp.
47
47
48
48
Also for reducing the number of TPU compilation, serving platforms(For example,
49
-
[TensorFlow Serving](https://www.tensorflow.org/tfx/guide/serving)) prefers batching the inference requests with a few allowed batch
49
+
Servomatic) prefers batching the inference requests with a few allowed batch
50
50
sizes. This requires wrapping TPUPartitionedCall in another function, and called
51
51
by BatchFunction.
52
52
53
53
Below is an intuitive example of how a TPU graph is different from a CPU one:
54
54
55
-

55
+

0 commit comments