Tensor parallel muon #1865

nagarajankarthik · 2025-10-15T04:21:57Z

This pull request attempts to add support for running the muon optimizer with tensor parallelism. It builds upon the code introduced in this pull request( Dist_Muon optimizer support #1813).

Signed-off-by: Boxiang Wang <[email protected]>

…_list.

copy-pr-bot · 2025-10-15T04:22:01Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

BoxiangW · 2025-10-22T05:58:24Z

Hi thanks for the contribution, we have already merged this https://github.com/NVIDIA/Megatron-LM/blob/dev/megatron/core/optimizer/muon.py in our new dev branch, please feel free to take a look.

BoxiangW and others added 15 commits September 15, 2025 14:52

Init commit for MCore with emergent-optimizer

d69d4d8

Signed-off-by: Boxiang Wang <[email protected]>

Format

c466bdc

Signed-off-by: Boxiang Wang <[email protected]>

Copyright

d97869e

Signed-off-by: Boxiang Wang <[email protected]>

Fix

1094261

Signed-off-by: Boxiang Wang <[email protected]>

change import name

a778f61

Signed-off-by: Boxiang Wang <[email protected]>

Switch to newer layerwise version

e38474a

Signed-off-by: Boxiang Wang <[email protected]>

Chanage to newer version of layerwisedistributed opt

d0542d5

Signed-off-by: Boxiang Wang <[email protected]>

fix

9612f2f

Signed-off-by: Boxiang Wang <[email protected]>

Working version

334b96d

Signed-off-by: Boxiang Wang <[email protected]>

Add launch script

c82a9f6

Signed-off-by: Boxiang Wang <[email protected]>

Improve example script

1cd42cb

Signed-off-by: Boxiang Wang <[email protected]>

Fix EP issue

dd7caa0

Signed-off-by: Boxiang Wang <[email protected]>

Fix EP issue

1ec8e27

Signed-off-by: Boxiang Wang <[email protected]>

Fix launch script

e747256

Signed-off-by: Boxiang Wang <[email protected]>

Updated condition for adding output layer parameters to linear_params…

632d73c

…_list.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Tensor parallel muon #1865

Tensor parallel muon #1865

Uh oh!

nagarajankarthik commented Oct 15, 2025

Uh oh!

copy-pr-bot bot commented Oct 15, 2025

Uh oh!

BoxiangW commented Oct 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Tensor parallel muon #1865

Are you sure you want to change the base?

Tensor parallel muon #1865

Uh oh!

Conversation

nagarajankarthik commented Oct 15, 2025

Uh oh!

copy-pr-bot bot commented Oct 15, 2025

Uh oh!

BoxiangW commented Oct 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants