Add Linux Aarch64 G3 runners to vLLM bms #93

ghost · 2025-10-09T14:06:37Z

PyTorch already have G3 runners (see https://hud.pytorch.org/runners/pytorch?search=m7g).
This patch lets us leverage those in the vLLM benchmark dashboard,
on the same model that's used to measure x86 performance.

This PR aims to fix vllm-project/vllm#26019

.github/workflows/vllm-benchmark.yml

huydhn · 2025-10-28T23:33:27Z

.github/workflows/vllm-benchmark.yml

-          else
-            ON_CPU=0
-          fi
+          ON_ARM64_CPU=0


I'm not sure about this part, setting the DEVICE_NAME to arm64-cpu is fine. But I think we need to keep ON_CPU=1 here for both x86 and arm. That's a flag from vLLM itself to run benchmark on CPU https://github.com/vllm-project/vllm/blob/94666612a938380cb643c1555ef9aa68b7ab1e53/docs/contributing/benchmarks.md?plain=1#L1177

Alright, I skipped setting ON_CPU for arm since this env var is used in .github/scripts/run-sglang-performance-benchmarks.sh expecting either NUMA nodes or GPUs, of which neither should be available on the existing Arm Neoverse runners. So, if we set ON_CPU to 1 in this case, I'd expect some warnings there.

Also, in vllm/.buildkite/nightly-benchmarks/scripts/run-performance-benchmarks.sh we check these env vars and again expect NUMA nodes for CPUs, so a distinction needs to be made here for the available Aarch64 runners, see this PR

meta-cla · 2025-11-02T21:11:38Z

Hi @ghost!

Thank you for your pull request.

We require contributors to sign our Contributor License Agreement, and yours needs attention.

You currently have a record in our system, but the CLA is no longer valid, and will need to be resubmitted.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

meta-cla bot added the cla signed label Oct 9, 2025

cfRod requested a review from huydhn October 9, 2025 14:21

ghost mentioned this pull request Oct 9, 2025

Enable aarch64 CPU performance benchmarks vllm-project/vllm#26494

Open

4 tasks

ghost had a problem deploying to pytorch-x-vllm October 10, 2025 07:58 — with GitHub Actions Failure

huydhn reviewed Oct 10, 2025

View reviewed changes

.github/workflows/vllm-benchmark.yml Outdated Show resolved Hide resolved

ghost had a problem deploying to pytorch-x-vllm October 14, 2025 08:59 — with GitHub Actions Failure

huydhn reviewed Oct 28, 2025

View reviewed changes

ioghiban force-pushed the add-linux-aarch64-g3-runners branch 2 times, most recently from ca16a91 to 497b78b Compare November 4, 2025 14:15

Add Linux Aarch64 G3 runners to vLLM bms

ab9c822

ioghiban force-pushed the add-linux-aarch64-g3-runners branch from 497b78b to ab9c822 Compare November 4, 2025 14:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Linux Aarch64 G3 runners to vLLM bms #93

Add Linux Aarch64 G3 runners to vLLM bms #93

ghost commented Oct 9, 2025 •

edited by ghost

Loading

Uh oh!

Uh oh!

huydhn Oct 28, 2025

Uh oh!

ioghiban Nov 4, 2025

Uh oh!

ioghiban Nov 4, 2025

Uh oh!

meta-cla bot commented Nov 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add Linux Aarch64 G3 runners to vLLM bms #93

Are you sure you want to change the base?

Add Linux Aarch64 G3 runners to vLLM bms #93

Conversation

ghost commented Oct 9, 2025 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

huydhn Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

ioghiban Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

ioghiban Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

meta-cla bot commented Nov 2, 2025

Process

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ghost commented Oct 9, 2025 •

edited by ghost

Loading