Return geometry by value (alternative 1) #885

SoilRos · 2025-06-13T08:36:06Z

This is to conform with the dune interface.

Note that this is possible since the introduction of shared pointers into the geometry. On the other hand, because copying shared pointer can cause performance problems I also propose an alternative #884 and compare them with the benchmarks.

SoilRos · 2025-06-13T09:24:52Z

jenkins build this serial please

akva2 · 2025-06-13T09:30:26Z

jenkins build this serial please

atgeirr · 2025-06-13T11:23:17Z

No matter what alternative we end up preferring, we should benchmark to make sure performance does not suffer. I do not expect anythin major though.

SoilRos · 2025-06-26T11:49:16Z

jenkins build this serial please

SoilRos · 2025-06-26T13:09:48Z

jenkins build this serial please

SoilRos · 2025-06-26T17:38:14Z

jenkins build this serial please

SoilRos · 2025-06-27T08:35:56Z

jenkins build this serial please

SoilRos · 2025-07-04T10:26:31Z

jenkins build this serial please

blattms · 2025-07-09T12:19:47Z

benchmark please

opm/grid/transmissibility/TransTpfa_impl.hpp

blattms · 2025-07-11T08:25:21Z

Benchmark results did not get posted back here. I am on it.
The gist of the benchmark is, that there is often a small increase in solution time. Often less than 1%. There is on outlier though: Drogon with 8 processes where the increase is 4%. I have no clue why. We'll investigate that...

SoilRos · 2025-07-11T12:07:55Z

[...] There is on outlier though: Drogon with 8 processes where the increase is 4%. I have no clue why. We'll investigate that...

This is not really surprising. Copying a shared pointer involves making an atomic increment to the reference counter. Even when only one processor is involved, this can take quite a number of cycles. That is usually manageable if the code is sequential, but as soon as the code becomes parallel all of these increments have to "compete" to block the bus that communicates with the other the processors. So, even if the pointer never has to be shared with other processors, the overhead of making this operation very often becomes quite noticeable. So this is one of the most common pitfalls of using shared pointers.

blattms · 2025-07-14T10:04:23Z

benchmark please

ytelses · 2025-07-14T23:38:47Z

Benchmark result overview:

Test	Configuration	Relative
opm-git	OPM Benchmark: drogon - Threads: 1	0.829
opm-git	OPM Benchmark: drogon - Threads: 8	0.838
opm-git	OPM Benchmark: punqs3 - Threads: 1	1
opm-git	OPM Benchmark: punqs3 - Threads: 8	0.976
opm-git	OPM Benchmark: smeaheia - Threads: 1	0.857
opm-git	OPM Benchmark: smeaheia - Threads: 8	0.965
opm-git	OPM Benchmark: spe10_model_1 - Threads: 1	1.005
opm-git	OPM Benchmark: spe10_model_1 - Threads: 8	0.995
opm-git	OPM Benchmark: flow_mpi_extra - Threads: 1	0.884
opm-git	OPM Benchmark: flow_mpi_extra - Threads: 8	0.968
opm-git	OPM Benchmark: flow_mpi_norne - Threads: 1	1.001
opm-git	OPM Benchmark: flow_mpi_norne - Threads: 8	1.01
opm-git	OPM Benchmark: flow_mpi_norne_4c_msw - Threads: 1 - FOPT (Total Oil Production At End Of Run)	1
opm-git	OPM Benchmark: flow_mpi_norne_4c_msw - Threads: 8 - FOPT (Total Oil Production At End Of Run)	0.999
opm-git	OPM Benchmark: flow_mpi_norne_4c_msw - Threads: 8	0.958

Speed-up = Total time master / Total time pull request. Above 1.0 is an improvement. *

View result details @ https://www.ytelses.com/opm/?page=result&id=2822

blattms · 2025-07-15T10:08:31Z

Comparing alternative 1, 2 and 3:

Configuration	Relative (alt 1 #885 )	Relative (alt 2 #884 )	Relative (alt 3 #891 )
OPM Benchmark: drogon - Threads: 1	0.829	1.007	1.004
OPM Benchmark: drogon - Threads: 8	0.838	0.899	0.905
OPM Benchmark: punqs3 - Threads: 1	1	1.002	0.991
OPM Benchmark: punqs3 - Threads: 8	0.976	0.991	0.995
OPM Benchmark: smeaheia - Threads: 1	0.857	1.006	0.895
OPM Benchmark: smeaheia - Threads: 8	0.965	0.922	0.997
OPM Benchmark: spe10_model_1 - Threads: 1	1.005	1.004	0.992
OPM Benchmark: spe10_model_1 - Threads: 8	0.995	0.991	0.995
OPM Benchmark: flow_mpi_extra - Threads: 1	0.884	1	0.948
OPM Benchmark: flow_mpi_extra - Threads: 8	0.968	0.898	1.026
OPM Benchmark: flow_mpi_norne - Threads: 1	1.001	0.998	1.002
OPM Benchmark: flow_mpi_norne - Threads: 8	1.01	0.923	1.007
OPM Benchmark: flow_mpi_norne_4c_msw - Threads: 1 - FOPT (Total Oil Production At End Of Run)	1	1	1
OPM Benchmark: flow_mpi_norne_4c_msw - Threads: 8 - FOPT (Total Oil Production At End Of Run)	0.999	1	1
OPM Benchmark: flow_mpi_norne_4c_msw - Threads: 8	0.958

Speed-up = Total time master / Total time pull request. Above 1.0 is an improvement. *

View result details for alt 1 @ https://www.ytelses.com/opm/?page=result&id=2822

View result details for alt 2 @ https://www.ytelses.com/opm/?page=result&id=2823

View result details for alt 3 @ https://www.ytelses.com/opm/?page=result&id=2824

Note that #891 needs additional fixes from #893

SoilRos · 2025-07-22T16:03:24Z

I am kind of confused with those results: 17% of difference seems way too much for these changes. Out of curiosity, are those benchmarks compiled with Link Time Optimization (LTO)? Since the code in those methods is rather small, it could be that we are mostly measuring how long it takes to jump to the function instruction when LTO is not enabled.
And, are those simulation times, or do they also include the setup time? I am getting a very large and odd slowdown in setup of alternative 2 and I don't understand why.

Assuming that those reports are with LTO and the comparison is with simulation time, I get locally these values:

Configuration	Relative (alt 1 #885 )	Relative (alt 2 #884 )	Relative (alt 3 #891 )
OPM Benchmark: drogon - Threads: 1	0.998	0.988	0.985
OPM Benchmark: drogon - Threads: 8	1	0.985	0.972

If those are not done with LTO, I would suggest to at least enable them for the benchmarks and production environments. The speed up in drogon is of about 8% with any of the alternatives with LTO.

SoilRos · 2025-07-31T14:51:48Z

jenkins build this serial please

This is to conform with the dune interface

SoilRos · 2025-08-01T10:40:07Z

jenkins build this serial please

aritorto added the manual:irrelevant This PR is a minor fix and should not appear in the manual label Jun 13, 2025

SoilRos mentioned this pull request Jun 13, 2025

Let Entity return geometry as a copy. #334

Closed

SoilRos force-pushed the bugfix/return-geometry-by-value-1 branch 2 times, most recently from a80cfd9 to 66c86e5 Compare June 26, 2025 11:48

SoilRos force-pushed the bugfix/return-geometry-by-value-1 branch from 66c86e5 to 1e82b5e Compare June 26, 2025 17:31

SoilRos force-pushed the bugfix/return-geometry-by-value-1 branch from 1e82b5e to 2ca87d2 Compare June 27, 2025 08:35

SoilRos force-pushed the bugfix/return-geometry-by-value-1 branch 2 times, most recently from b8b09d2 to b3e4cf2 Compare July 4, 2025 10:25

This was referenced Jul 4, 2025

Return geometry by value (alternative 2) #884

Draft

Return geometry by value (alternative 3) #891

Draft

blattms approved these changes Jul 9, 2025

View reviewed changes

opm/grid/transmissibility/TransTpfa_impl.hpp Outdated Show resolved Hide resolved

SoilRos mentioned this pull request Jul 24, 2025

Reimplement support for Link Time Optimization OPM/opm-common#4666

Merged

SoilRos force-pushed the bugfix/return-geometry-by-value-1 branch from a2a089b to a771513 Compare July 31, 2025 14:48

SoilRos mentioned this pull request Jul 31, 2025

Revert entity test with geometry #904

Merged

Return geometry objects by value

5dedf94

This is to conform with the dune interface

SoilRos force-pushed the bugfix/return-geometry-by-value-1 branch from a771513 to 5dedf94 Compare August 1, 2025 10:39

Return geometry by value (alternative 1) #885

Are you sure you want to change the base?

Return geometry by value (alternative 1) #885

Uh oh!

Conversation

SoilRos commented Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SoilRos commented Jun 13, 2025

Uh oh!

akva2 commented Jun 13, 2025

Uh oh!

atgeirr commented Jun 13, 2025

Uh oh!

SoilRos commented Jun 26, 2025

Uh oh!

SoilRos commented Jun 26, 2025

Uh oh!

SoilRos commented Jun 26, 2025

Uh oh!

SoilRos commented Jun 27, 2025

Uh oh!

SoilRos commented Jul 4, 2025

Uh oh!

blattms commented Jul 9, 2025

Uh oh!

Uh oh!

blattms commented Jul 11, 2025

Uh oh!

SoilRos commented Jul 11, 2025

Uh oh!

blattms commented Jul 14, 2025

Uh oh!

ytelses commented Jul 14, 2025

Uh oh!

blattms commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SoilRos commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SoilRos commented Jul 31, 2025

Uh oh!

SoilRos commented Aug 1, 2025

Uh oh!

Uh oh!

SoilRos commented Jun 13, 2025 •

edited

Loading

blattms commented Jul 15, 2025 •

edited

Loading

SoilRos commented Jul 22, 2025 •

edited

Loading