[DP] Functional DP for GPT-OSS #1137

wenxindongwork · 2025-11-20T05:29:57Z

Description

Add functional DP support for GPT-OSS Torchax backend.

Verified baseline throughput unchanged (5037.82) , DP=2 throughput is 1.54x (7781.92).

Validated numerical correctness with offline_inference.py

Full details: https://paste.googleplex.com/5240826907197440

Tests

https://buildkite.com/tpu-commons/tpu-inference-ci/builds/5712
https://buildkite.com/tpu-commons/tpu-inference-ci/builds/5851

Checklist

Before submitting this PR, please make sure:

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have made or will make corresponding changes to any relevant documentation.

github-actions · 2025-11-20T05:30:12Z

Description

Start with a short description of what the PR does and how this is a change from
the past.

The rest of the description includes relevant details and context, examples:

why is this change being made,
the problem being solved and any relevant context,
why this is a good solution,
some information about the specific implementation,
shortcomings of the solution and possible future improvements.

If the change fixes a bug or a Github issue, please include a link, e.g.,:
FIXES: b/123456
FIXES: #123456

Tests

Please describe how you tested this change, and include any instructions and/or
commands to reproduce.

Checklist

Before submitting this PR, please make sure:

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have made or will make corresponding changes to any relevant documentation.

kyuyeunk

Please add torchax dp related unit tests.

tpu_inference/layers/vllm/fused_moe.py

tpu_inference/layers/vllm/quantization/common.py

kyuyeunk

Please add torchax dp related unit tests.

Also, please address this comment.

tpu_inference/layers/vllm/quantization/common.py

tpu_inference/layers/vllm/fused_moe.py

wenxindongwork · 2025-11-20T23:14:56Z

added e2e model parallelism test for Llama3.1 1b for torchax.

wenxindongwork requested review from hfan, kyuyeunk, vanbasten23 and vipannalla as code owners November 20, 2025 05:29

wenxindongwork force-pushed the torch-dp-pr branch from 1efb3dc to b10487a Compare November 20, 2025 05:32

wenxindongwork requested a review from yaochengji November 20, 2025 06:06

kyuyeunk reviewed Nov 20, 2025

View reviewed changes

tpu_inference/layers/vllm/fused_moe.py Show resolved Hide resolved

tpu_inference/layers/vllm/quantization/common.py Show resolved Hide resolved

kyuyeunk reviewed Nov 20, 2025

View reviewed changes

tpu_inference/layers/vllm/quantization/common.py Show resolved Hide resolved

tpu_inference/layers/vllm/fused_moe.py Show resolved Hide resolved

wenxindongwork added 9 commits November 23, 2025 02:52

squash

11b495d

wip

7e8f896

only submit model dp

9b400be

wip

995d0c2

wip

f91308e

formatting

fa7b2d2

wip

3737895

formatting

f7940d2

wip

d10b6ce

wenxindongwork force-pushed the torch-dp-pr branch from e1f54cb to d10b6ce Compare November 23, 2025 02:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DP] Functional DP for GPT-OSS #1137

[DP] Functional DP for GPT-OSS #1137

wenxindongwork commented Nov 20, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Nov 20, 2025

Uh oh!

kyuyeunk left a comment

Uh oh!

Uh oh!

Uh oh!

kyuyeunk left a comment

Uh oh!

Uh oh!

Uh oh!

wenxindongwork commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[DP] Functional DP for GPT-OSS #1137

Are you sure you want to change the base?

[DP] Functional DP for GPT-OSS #1137

Conversation

wenxindongwork commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

github-actions bot commented Nov 20, 2025

Description

Tests

Checklist

Uh oh!

kyuyeunk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kyuyeunk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

wenxindongwork commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wenxindongwork commented Nov 20, 2025 •

edited

Loading