Skip to content

[RMP] Performant large embedding table support #733

Open
@EvenOldridge

Description

@EvenOldridge

Problem:

Goal:

New Functionality

  • Models
    • ...
  • Transformers4Rec
    • ...
  • NVTabular
    • ...
  • Systems
    • ...

Constraints:

##Architectural consideration
NA

Starting Point:

Model Parallel Support

Feature engineering that reduces embedding size

  • Mixed Dimension Embeddings
  • Frequency Capping
  • Frequency Hashing
  • Bloom Embeddings
  • TT-Rec

Reduced Precision Support

  • Sparse Row-wise Optimizers (Facebook Research DLRM)
  • Reduced Precision Optimizers
  • Reduced Embedding Precision

Not storing user embeddings

  • Represent user as item embedding aggregations (YouTube DNN)

Inference Support

  • Hierarchical Parameter Server Support

Serving

Example

Metadata

Metadata

Labels

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions