ch4/shm: enable CMA by default #7639

yfguo · 2025-10-23T03:49:57Z

CMA should be enabled by default because test result show CMA outperform POSIX in mid to large message size.

The CMA threshold is set to 8K to match the eager threshold. 8KB message using POSIX pipeline is about 10% faster than using CMA in intra-NUMA case on Cascade Lake (fig 1), but not in all other cases. So setting the CMA threshold to the same as the eager threshold and completely bypass the POSIX pipeline seems a good idea.

Also, CMA seems not affected by across NUMA access and way outperformed POSIX pipeline.

@intel may need to change this if using eager module other than iqueue.

On Intel Xeon Gold 6226R (Cascade Lake)

On AMD, this seems to be a OK setting. Although AMD has this weird drop at 16KB message size in almost all cases.

TODO: add numbers for Aurora.

Pull Request Description

Author Checklist

Provide Description
Particularly focus on why, not what. Reference background, issues, test failures, xfail entries, etc.
Commits Follow Good Practice
Commits are self-contained and do not do two things at once.
Commit message is of the form: module: short description
Commit message explains what's in the commit.
Passes All Tests
Whitespace checker. Warnings test. Additional tests via comments.
Contribution Agreement
For non-Argonne authors, check contribution agreement.
If necessary, request an explicit comment from your companies PR approval manager.

CMA should be enabled by default because test result show CMA outperform POSIX in mid to large message size. The default CMA threshold is set to 8KB which matches the eager threshold determined by iqueue size.

hzhou

LGTM

hzhou · 2025-10-23T13:54:57Z

test:mpich/ch4/most

yfguo · 2025-10-23T14:28:11Z

Will merge this after getting Aurora numbers and verify the latency numbers as well.

ch4/shm: enable CMA by default

f9158ad

CMA should be enabled by default because test result show CMA outperform POSIX in mid to large message size. The default CMA threshold is set to 8KB which matches the eager threshold determined by iqueue size.

hzhou approved these changes Oct 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ch4/shm: enable CMA by default #7639

ch4/shm: enable CMA by default #7639

yfguo commented Oct 23, 2025

Uh oh!

hzhou left a comment

Uh oh!

hzhou commented Oct 23, 2025

Uh oh!

yfguo commented Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ch4/shm: enable CMA by default #7639

Are you sure you want to change the base?

ch4/shm: enable CMA by default #7639

Conversation

yfguo commented Oct 23, 2025

Pull Request Description

Author Checklist

Uh oh!

hzhou left a comment

Choose a reason for hiding this comment

Uh oh!

hzhou commented Oct 23, 2025

Uh oh!

yfguo commented Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants