Skip to content
This repository was archived by the owner on Jul 10, 2025. It is now read-only.

Commit b83d545

Browse files
Update rfcs/20191106-tf2-tpu-savedmodel.md
Co-Authored-By: Edd Wilder-James <[email protected]>
1 parent 4a6fa51 commit b83d545

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

rfcs/20191106-tf2-tpu-savedmodel.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,7 @@ Some major differences between CPU and TPU Graph:
4646
VarHandleOp, and consumed by ReadVariableOp.
4747

4848
Also for reducing the number of TPU compilation, serving platforms(For example,
49-
Servomatic) prefers batching the inference requests with a few allowed batch
49+
[TensorFlow Serving](https://www.tensorflow.org/tfx/guide/serving)) prefers batching the inference requests with a few allowed batch
5050
sizes. This requires wrapping TPUPartitionedCall in another function, and called
5151
by BatchFunction.
5252

0 commit comments

Comments
 (0)