[do not land] Example augmenting GPU profiler trace with model stack traces #1990

yushangdi · 2025-11-04T18:17:15Z

TORCH_ENRICH_RPOFILER_STACK_TRACE=1  NGPU=8 CONFIG_FILE=./torchtitan/models/llama3/train_configs/debug_model.toml ./run_train.sh --model.name compiler_toolkit.llama3 --parallelism.data_parallel_shard_degree=2 --parallelism.tensor_parallel_degree=4 --model.flavor=debugmodel_flex_attn

Requires pytorch/pytorch#167171 and pytorch/pytorch#167114

You can check the augmented trace in manifold/explorer/pytorch/tree/shangdiy/rank0_trace_augmented.json

cc @SherlockNoMad

example profiling llama

e86c4a2

yushangdi requested review from fegin, tianyu-l, wconstab and wwwjn as code owners November 4, 2025 18:17

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 4, 2025

yushangdi marked this pull request as draft November 4, 2025 18:17

yushangdi changed the title ~~Example augmenting GPU profiler trace with model stack traces~~ [do not land] Example augmenting GPU profiler trace with model stack traces Nov 4, 2025

yushangdi mentioned this pull request Nov 4, 2025

Add model code stack trace to torch.profile pytorch/pytorch#166677

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[do not land] Example augmenting GPU profiler trace with model stack traces #1990

[do not land] Example augmenting GPU profiler trace with model stack traces #1990

Uh oh!

yushangdi commented Nov 4, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[do not land] Example augmenting GPU profiler trace with model stack traces #1990

Are you sure you want to change the base?

[do not land] Example augmenting GPU profiler trace with model stack traces #1990

Uh oh!

Conversation

yushangdi commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yushangdi commented Nov 4, 2025 •

edited

Loading