-
Notifications
You must be signed in to change notification settings - Fork 124
Description
Problem:
Merlin provides documentation and a number of example notebooks on how to use tools like NVTabular, Dataloader and Merlin Models. In order to build a pipeline for training and evaluation purposes, a Data Scientist needs to analyze that material, copy-and-paste code snippets demonstrating the API and glue that code together to implement scripts for experimentation and benchmarking.
It might also not be clear to the users the advanced API options featured by Merlin Models that can be mapped as a hyperparameter, and potentially improve models accuracy.
Goal:
This RMP provides a Quick-start for building ranking models training pipelines.
It addresses the ranking models part of this larger RMP NVIDIA-Merlin/models#732, in particular the steps 4-7 of the Data Scientist journey when experimenting with Merlin Models.
The Quick start for ranking is composed by:
Template scripts
- Generic template script for preprocessing
- Generic template script for training ranking models, exposing the main hyperparameters for ranking models .
It includes support to ranking models like DLRM, DCN-v2, DeepFM, Wide&Deep and multi-task learning (MTL) like MMOE, CGC and PLE.
Documentation
- Documentation of the scripts command line arguments
- Documentation of best practices learned from our experimentation:
- Hyperparameter tuning: search space, most important hyperparameters and best hparams for TenRec public dataset for STL and MTL models
- Intuitions of API options (building blocks, arguments) that can improve models accuracy
Constraints:
- Preprocessing - The pre-processing template notebook will perform some basic feature encoding for categorical (e.g. categorify) and continuous variables (e.g. standardization). The customer can expand the template with advanced preprocessing ops demonstrated in our examples.
- Training - The training and evaluation script for Merlin Models should be totally configurable, taking as input the parquet files and schema, and a number of hyperparameters exposed via command line arguments. The output of this script should be the evaluation metrics, logged as a CSV file and also to Weights&Biases.
Starting Point:
The ranking training scripts we have developed for the MTL research project.
Tasks:
- [Task] Create a generic dataset preprocessing template for ranking models #916
- [Task] Update multi-task learning models and tests to use
OutputBlockinstead ofPredictionsTasksmodels#914 - Update the research STL/MTL ranking training script to use the latest API (
PredictionBlockinstead ofPredictionTask) #917 - Create a generic STL/MTL ranking training/eval script based on the research scripts #918
Tenrec dataset experiments
- Run hypertuning experiments for the STL ranking models: MLP, Wide&Deep, DCN-v2, DeepFM, DLRM for Tenrec dataset #680
- Run hypertuning experiments for the MTL ranking models: MMoE, PLE for Tenrec dataset #681
Documentation
- Create documentation for Quick-Start for Ranking (CLI args, best practices and tutorials) #666
- Create the main quick-start for ranking documention
- Create a document explaining all arguments for
preprocessing.pyandranking_train_eval.py - Create a document with the best practices for preprocessing
- Create a document with the best practices for building and training ranking models
- Create a document with the best practices for hypertuning and with the results of the TenRec experiments (in progress)
- Create a tutorial on using Quick-start scripts with W&B Sweeps for HPO
Deployment and inference with Triton
- [TASK] Quick start for Ranking - Include support for Triton inference #919
- Create Quick-start script to build and export a Triton ensemble (NVT + Models) using Systems #912
- Create a Quick-start notebook demonstrating how to prepare an inference request to Triton #913
- Create markdown documentation on how to use the quick-start deployment script #914