[RMP] Support Offline Batch processing of Recs Generation Pipelines

## Problem:
As a user, I would like to run my merlin systems inference pipeline in an offline setting. This will allow me to produce a set of recommendations for all users to be served from a data store, email campaign, etc. I will also be able to conduct rigorous testing and better compare behaviors against other systems, at both operator and system level. 


## Goal:

To do this I need to be able to run my merlin systems inference graph without using triton or the configs generated for it. It will require a new operator executor class that runs the ops in python instead of tritonserver. The execution should behave exactly as it does in the tritonserver setting, meaning each operator should be provided same inputs, and return same outputs.

- Run an Inference operator graph without tritonserver.
- Does not require any new user-facing API changes. 
- Execute the same graph, that would be deployed to tritonserver.
- Execute in Python process

## Constraints:
- Use the same merlin systems graph/ops that were created for inference pipeline, that would run on tritonserver
- Swap out the operator executor to python version (non-triton).
- Allow for all types of graphs, supporting multiple chains and parallel running of ALL available operators.

## TODO:

### Core
- [x] https://github.com/NVIDIA-Merlin/core/pull/140
- [x] https://github.com/NVIDIA-Merlin/core/pull/141
- [x] https://github.com/NVIDIA-Merlin/core/pull/143
- [x] https://github.com/NVIDIA-Merlin/core/pull/146

### Systems
- [x] https://github.com/NVIDIA-Merlin/systems/pull/204
- [x] Validate that we can run a systems ensemble on Dask


### Issues
- [x] #461
- [x] #462
- [x] #463
- [x] https://github.com/NVIDIA-Merlin/Merlin/issues/505
- [x] https://github.com/NVIDIA-Merlin/Merlin/issues/506
- [x] https://github.com/NVIDIA-Merlin/Merlin/issues/507

### Example
- [ ] #798
```[tasklist]
### Tasks
- [ ] Create Offline runtime, that will swap operators according to usage i.e. (swap feast operator for dataset merge operator.
- [ ] Ensure every operator returns batch based results. I.e. faiss should return batch representation of inputs. I.e. 2 users in should produce  (2, 100) not (200,) shape.
- [ ] Create an offline example from the current multistage example in merlin
- [ ] Ensure ensemble export does not prevent using Non-triton runtimes later.
```


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RMP] Support Offline Batch processing of Recs Generation Pipelines #419

Problem:

Goal:

Constraints:

TODO:

Core

Systems

Issues

Example

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[RMP] Support Offline Batch processing of Recs Generation Pipelines #419

Description

Problem:

Goal:

Constraints:

TODO:

Core

Systems

Issues

Example

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions