Open
Description
Problem:
Goal:
New Functionality
- Models
- ...
- Transformers4Rec
- ...
- NVTabular
- ...
- Systems
- ...
Constraints:
##Architectural consideration
NA
Starting Point:
Model Parallel Support
-
Evaluation of HugeCTR, TorchRec, Distributed Embeddings, TFRA, PersiaML for inclusion in Merlin - Distributed embedding table support in merlin-models (SOK Plugin, Distributed Embeddings)
- Model Parallel Training ( related to SOK integration in Merlin Models )
- Third Gen Embeddings
Feature engineering that reduces embedding size
- Mixed Dimension Embeddings
- Frequency Capping
- Frequency Hashing
- Bloom Embeddings
- TT-Rec
Reduced Precision Support
- Sparse Row-wise Optimizers (Facebook Research DLRM)
- Reduced Precision Optimizers
- Reduced Embedding Precision
Not storing user embeddings
- Represent user as item embedding aggregations (YouTube DNN)
Inference Support
- Hierarchical Parameter Server Support
Serving