How to better apply different LJ potential energies to each pair of particles？ #2037

Boran-Zhao · 2025-04-03T05:17:37Z

Boran-Zhao
Apr 3, 2025

Dear all
I am using hoomd to simulate the protein Go model, in which each particle is CA bead but has different epsilon and sigma of LJ parameters. So, If I use hoomd.md.pair.LJ, I have to set each particle to a different type. For example, I have 526 CA beads in my simulation system, I set 526 types. When I run my simulation, a warning is occured:
_Warning: Systems with many particle types perform poorly or result in shared memory errors on the GPU._*

Does this warning have any impact on the actual simulation?

Also, my simulation runs very slowly. I used an NVIDIA GeForce RTX 4090 with 4 CPU cores to run a simulation (526 particles, with Yukawa and OPP potentials) and it only ran for 5,000,000 steps in 12 hours. Is this normal in your opinion?

I look forward to your help！

Answered by joaander

Apr 3, 2025

Your simulation will run more efficiently (and possibly even faster) on a single CPU core. Simulations generally need at least 10,000 particles before GPU throughput breaks even with a CPU (~64 cores).

https://github.com/glotzerlab/hoomd-benchmarks has scripts to evaluate this. The output is in steps per second. For comparison, your simulation runs at 5e6/(12*3600) = 115 steps per second.

The benchmark command is: python3 -m hoomd_benchmarks.md_pair_lj -N {N} --device {DEVICE} --repeat 20 --n_types {N_types}

Device	N	N_types	TPS
CPU	526	1	1482
CPU	526	526	1037
GPU	526	1	14226
GPU	526	526	14070
GPU	10000	1	11069
GPU	1000	526	8878

I ran this on an A100 GPU. Your RTX 4090…

View full answer

joaander · 2025-04-03T14:05:26Z

joaander
Apr 3, 2025
Maintainer

Your simulation will run more efficiently (and possibly even faster) on a single CPU core. Simulations generally need at least 10,000 particles before GPU throughput breaks even with a CPU (~64 cores).

https://github.com/glotzerlab/hoomd-benchmarks has scripts to evaluate this. The output is in steps per second. For comparison, your simulation runs at 5e6/(12*3600) = 115 steps per second.

The benchmark command is: python3 -m hoomd_benchmarks.md_pair_lj -N {N} --device {DEVICE} --repeat 20 --n_types {N_types}

Device	N	N_types	TPS
CPU	526	1	1482
CPU	526	526	1037
GPU	526	1	14226
GPU	526	526	14070
GPU	10000	1	11069
GPU	1000	526	8878

I ran this on an A100 GPU. Your RTX 4090 is a single precision optimized card and therefore will run double precision calculations very slowly. You should run production HOOMD-blue simulations on an institutional or national cluster with double precision GPUs (currently V100, A100, or H100).

The last two lines answer your original question of how much increasing the number of types costs on the GPU. The pair force calculations in HOOMD-blue are optimized for systems with a large number of particles and a small number of types. If you want to increase performance further, you will need to write custom kernels (CPU and/or GPU) that implement an optimized data structure for the N^2 type parameters.

1 reply

Boran-Zhao Apr 4, 2025
Author

Thank you for the detailed response! I really appreciate your patience!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to better apply different LJ potential energies to each pair of particles？ #2037

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to better apply different LJ potential energies to each pair of particles？ #2037

Uh oh!

Boran-Zhao Apr 3, 2025

Replies: 1 comment · 1 reply

Uh oh!

joaander Apr 3, 2025 Maintainer

Uh oh!

Boran-Zhao Apr 4, 2025 Author

Boran-Zhao
Apr 3, 2025

Replies: 1 comment 1 reply

joaander
Apr 3, 2025
Maintainer

Boran-Zhao Apr 4, 2025
Author