This repository was archived by the owner on Jul 10, 2025. It is now read-only.
Commit 98c7b77
committed
| Status | (Proposed / Accepted / Implemented / Obsolete) |
| :------------ | :------------------------------------------------------ |
| **RFC #** | [NNN](https://github.com/tensorflow/community/pull/NNN) |
: : (update when you have community PR #) :
| **Author(s)** | Andrew Audibert ([email protected]) Rohan Jain |
: : ([email protected]) :
| **Sponsor** | Jiri Simsa ([email protected]) |
| **Updated** | 2019-01-13
Provide an API and implementation of a tf.data service which can process tf.data
datasets in a distributed manner. The service can be run outside the TensorFlow
cluster or be exported as a gRPC service by TensorFlow servers.
Goals:
- Enable horizontal scaling of dataset computation to improve performance of
input-bound dataset pipelines.
- Improve tf.data integration with the tf.distribute API. In particular,
support dynamic sharding of data across multiple processes.
- Provide visitation guarantees for distributed training jobs.1 parent ca8039f commit 98c7b77
1 file changed
+591
-0
lines changed
0 commit comments