P2PCommunication Benchmark over Cuda IPC #5102

nsarka · 2025-09-02T13:26:59Z

Sample output, luna80g partition on dlcluster:

nsarkauskas@luna-prod-1315-au:/opt/pytorch/Fuser$ mpirun -np 2 -H luna-prod-1315-au:2 ./python/build/nvfuser_p2p_communication_bench
Starting P2P communication benchmark...
Repetitions per size: 100
Number of devices: 2
Testing tensor sizes from 2^10 to 2^26 elements

Message Size   Elements    Latency (μs)  Bandwidth (GB/s)
------------------------------------------------------------
4 KB           1024        123.50         0.03
8 KB           2048        125.11         0.06
16 KB          4096        124.52         0.12
32 KB          8192        125.16         0.24
64 KB          16384       163.03         0.37
128 KB         32768       121.25         1.01
256 KB         65536       121.82         2.00
512 KB         131072      137.84         3.54
1 MB           262144      121.56         8.03
2 MB           524288      123.86         15.77
4 MB           1048576     130.98         29.82
8 MB           2097152     167.01         46.78
16 MB          4194304     278.03         56.20
32 MB          8388608     504.67         61.92
64 MB          16777216    872.91         71.60
128 MB         33554432    1523.83        82.03
256 MB         67108864    2981.99        83.84

The test will create a P2PCommunication with the cuda ipc backend, put it inside a HostIrEvaluator, then run it. The timer measuring the latency is std::chrono::high_resolution_clock.

wujingyue

LGTM otherwise

CMakeLists.txt

Priya2698

Overall, LGTM.

benchmarks/cpp/p2p_communication.cpp

wujingyue

LGTM. Defer approval to @Priya2698

CMakeLists.txt

benchmarks/cpp/p2p_communication.cpp

nsarka · 2025-09-05T22:24:24Z

!test

Priya2698

LGTM.
Can you add the SOL bandwidth expected for the results as a reference to the PR description?

nsarka requested a review from wujingyue September 2, 2025 13:27

nsarka force-pushed the nsarka/cuda-ipc-benchmark branch from 51a90f2 to 3fd9af2 Compare September 2, 2025 13:31

wujingyue reviewed Sep 2, 2025

View reviewed changes

CMakeLists.txt Show resolved Hide resolved

wujingyue requested a review from Priya2698 September 2, 2025 13:37

Priya2698 reviewed Sep 2, 2025

View reviewed changes

benchmarks/cpp/p2p_communication.cpp Outdated Show resolved Hide resolved

benchmarks/cpp/p2p_communication.cpp Outdated Show resolved Hide resolved

benchmarks/cpp/p2p_communication.cpp Outdated Show resolved Hide resolved

benchmarks/cpp/p2p_communication.cpp Outdated Show resolved Hide resolved

nsarka force-pushed the nsarka/cuda-ipc-benchmark branch from 2805c45 to 88b9974 Compare September 2, 2025 20:35

wujingyue reviewed Sep 3, 2025

View reviewed changes

CMakeLists.txt Outdated Show resolved Hide resolved

CMakeLists.txt Outdated Show resolved Hide resolved

nsarka force-pushed the nsarka/cuda-ipc-benchmark branch from 88b9974 to fa71128 Compare September 3, 2025 14:51

Priya2698 reviewed Sep 3, 2025

View reviewed changes

benchmarks/cpp/p2p_communication.cpp Outdated Show resolved Hide resolved

nsarka and others added 10 commits September 5, 2025 18:23

Add cuda ipc benchmarks

0d167e1

Update

b9f57dc

Add basic P2PCommunication benchmark

5fbdf7d

Add powers of two message sizes

1290aa8

Minor fixes

bdd11fb

Lint fix

05d11c6

Review

dba7649

Remove extras

53b6e7b

Remove makeContigTensor

15085f7

Fix other sizeof

874eef9

nsarka force-pushed the nsarka/cuda-ipc-benchmark branch from fbab35a to 874eef9 Compare September 5, 2025 22:23

nsarka requested a review from Priya2698 September 5, 2025 22:24

Priya2698 approved these changes Sep 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

P2PCommunication Benchmark over Cuda IPC #5102

P2PCommunication Benchmark over Cuda IPC #5102

nsarka commented Sep 2, 2025 •

edited

Loading

Uh oh!

wujingyue left a comment

Uh oh!

Uh oh!

Priya2698 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wujingyue left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nsarka commented Sep 5, 2025

Uh oh!

Priya2698 left a comment

Uh oh!

Uh oh!

P2PCommunication Benchmark over Cuda IPC #5102

Are you sure you want to change the base?

P2PCommunication Benchmark over Cuda IPC #5102

Conversation

nsarka commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wujingyue left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Priya2698 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wujingyue left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nsarka commented Sep 5, 2025

Uh oh!

Priya2698 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nsarka commented Sep 2, 2025 •

edited

Loading