[RMP] Multi-GPU Data Parallel training for Tensorflow in Merlin Models

## Problem:
Single GPU training takes significantly longer than multi-gpu.  Customers would like to be able to accelerate their training workflows by distributing training across multiple GPUs on a single node.

## Goal:
Enable customers to do data parallel training within Merlin Models training pipeline.

## Constraints:
 - Single node
 - Embedding tables fit within the memory of a single gpu
 - Use NVIDIA best practices; aka Horovod

## Starting Point:
- [x] NVIDIA-Merlin/models#651

## Example
- [x] NVIDIA-Merlin/models#693

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RMP] Multi-GPU Data Parallel training for Tensorflow in Merlin Models #536

Problem:

Goal:

Constraints:

Starting Point:

Example

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[RMP] Multi-GPU Data Parallel training for Tensorflow in Merlin Models #536

Description

Problem:

Goal:

Constraints:

Starting Point:

Example

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions