Skip to content

Conversation

@shbiswas834
Copy link

@shbiswas834 shbiswas834 commented Sep 23, 2025

5% uplift from B200
28% uplift from MI350

YanXiong-Meta and others added 2 commits September 2, 2025 14:38
Summary:
Pull Request resolved: pytorch#4648

X-link: facebookresearch/FBGEMM#1682

Add TBE/VBE bench  (-- device-with-speclist) that takes dim lists for args under tbe_training.

#### Summary

This diff introduces a new feature to the TBE (Table Batched Embedding) bench, which allows for benchmarking with a list of devices and speclist. The changes include:

*   **tbe_data_config_loader.py:** Added a new class `TBEDataListConfig` to load a list of `TBEDataConfig` objects. Modified the `TBEDataConfigLoader` class to load a list of `TBEDataConfig` objects.
*   **bench_runs.py:** Added a new function `bench_warmup_with_spec` to perform warm-up runs with a specified list of devices and batch sizes.
*   **tbe_data_config_param_models.py:** Added a new class `BatchListParams` to represent a list of batch sizes and their corresponding standard deviations.
*   **__init__.py:** Imported the new `TBEDataListConfig` class and the `benchmark_requests_with_spec` function.
*   **tbe_training_benchmark.py:** Added a new function `benchmark_requests_with_spec` to benchmark requests with a specified list of devices and batch sizes.

These changes enable the TBE bench to support benchmarking with a list of devices and speclist, making it more flexible and efficient for testing and optimizing TBE performance.

Reviewed By: spcyppt

Differential Revision: D78390216
@shbiswas834 shbiswas834 changed the title R&R model benchmark and optimizations fwd optimizations Sep 23, 2025
@shbiswas834 shbiswas834 reopened this Sep 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants