(feat) Utvd optimization #2427

Manangka · 2025-08-01T09:29:41Z

This PR adds several optimizations to improve performance of the UTVD algorithm.

After comparing the new UTVD scheme with the existing TVD scheme it turned out that the performance of the UTVD can be twice as bad as the TVD scheme. After profiling the code using gprof and perf it turned out that the exact same computations of several methods are happening multiple times Those methods are.

gradients
local extrema
node distances

They are computed each time when the flux through a connection is computed. However this isn't nesscary. For instance the node_distance only need to be computed once during the initialization phase and then cached.

For the gradient and local extrema the computation need to happen once every outer iteration. And it can be done in one go for all cells, instead for evaluating it for every connection.

For this an invalidate method has been added to the IInterpolationSchemeInterface which is called every time the adv_fc is called. On invalidation the gradients and local extrema are recomputed.

The performance has been tested using the (gwt-adv-schemes) using a fixed timestep such that all models have the same amount of total time steps

Structured

	Block	Step	Sin2
TVD	8170.00 ms	8527.58 ms	9261.64 ms
UTVD	21615.25 ms	21376.20 ms	20762.04 ms
UTVD-Optimized	10197.65 ms	10571.01 ms	10452.84 ms

Triangle

	Block	Step	Sin2
TVD	12550.05 ms	12802.96 ms	10638.20 ms
UTVD	21527.24 ms	19526.76 ms	13936.71 ms
UTVD-Optimized	13015.30 ms	13352.33 ms	11749.55 ms

Voronoi

	Block	Step	Sin2
TVD	28370.61 ms	30232.90 ms	27591.43 ms
UTVD	37455.10 ms	40452.87 ms	34365.99 ms
UTVD-Optimized	21431.70 ms	20462.69 ms	19549.89 ms

Checklist of items for pull request

For additional information see instructions for contributing and instructions for developing.

# Conflicts: # make/makefile # msvs/mf6core.vfproj # src/Model/TransportModel/InterpolationScheme/AdvSchemeEnum.f90 # src/Model/TransportModel/InterpolationScheme/CentralDifferenceScheme.f90 # src/Model/TransportModel/InterpolationScheme/TVDScheme.f90 # src/Model/TransportModel/tsp-adv.f90 # src/Utilities/LinearAlgebraUtils.f90 # src/meson.build

…changes

…d of the G matrix

# Conflicts: # make/makefile

# Conflicts: # make/makefile # src/Model/TransportModel/tsp-adv.f90

# Conflicts: # src/Model/TransportModel/InterpolationScheme/UTVDScheme.f90

aprovost-usgs

It looks like some simplification of the new code is possible

aprovost-usgs · 2025-08-13T15:09:29Z

src/Model/TransportModel/tsp-adv.f90

    type(CoefficientsType) :: coefficients

    ! Calculate internal domain fluxes and add to matrix_sln and rhs.
+    call this%face_interpolation%invalidate()


This was introduced as part of a strategy to calculate the gradients and extrema for all the cells in one go, rather than every time face_interpolation%compute is called for a cell-cell connection. This is currently accomplished by (1) invalidating the cache before entering the connections loop, (2) on the first call to face_interpolation%compute, calculating and caching all the gradients and extrema and revalidating the cache so that (3) on subsequent calls to face_interpolation%compute the cached gradients and extrema can be used (not recalculated).

While there's nothing wrong with this logic, it might be more straightforward to move the calculation of the gradients and extrema out of face_interpolation%compute, i.e., simply call face_interpolation%compute_gradients and face_interpolation%compute_local_extrema (here, in adv_fc) before entering the connections loop. That would eliminate the need for face_interpolation%invalidate and the cache_valid flag and would separate the cell-by-cell calculations from the connection-by-connection calculations. To underscore its more specific role, face_interpolation%compute could be renamed face_interpolation%compute_coefficients.

If you keep the current strategy based on the cache_valid flag, consider either introducing a face_interpolation%validate routine to complement the invalidate routine, or (my preference) eliminating the invalidate routine and setting this%cache_valid=.false. directly in adv_fc and adv_cq, similarly to how you set this%cache_valid=.true. directly in the compute routine.

aprovost-usgs · 2025-08-13T15:12:39Z

src/Model/TransportModel/tsp-adv.f90

    !    rate and has dimensions of L^/T.
    nodes = this%dis%nodes

+    call this%face_interpolation%invalidate()


Please see my comment in adv_fc above. The same approach would be used here.

aprovost-usgs · 2025-08-19T16:05:33Z

src/Model/TransportModel/InterpolationScheme/UTVDScheme.f90

+    real(DP), intent(in), dimension(:), pointer :: phi
+
+    this%phi => phi
+    call this%gradient%set_field(phi)


UTVD is currently the only one of the four interpolation schemes that requires a gradient calculation. As this calculation is expensive, performance has been improved by doing it only when necessary and using cached gradient values otherwise. Recalculation of the gradient must be done before entering the connections loop in adv_fc and adv_cq. Once inside the loop, cached values can be used.

When UTVD is used, recalculation of the gradient is performed during a call (from adv_fc or adv_cq) to the set_field procedure of the UTVD scheme, which in turn calls the set_field procedure of the gradient object. In the case of UTVD, the gradient object is wrapped in a decorator (CachedGradientType) that adds a caching capability. The set_field procedure of the "decorated" gradient object includes a recalculation of all the cell gradient values, which it caches in an array. The get procedure of the decorated gradient object, which is used in UTVD's compute procedure, then simply extracts the required values from the cache.

If the decorator is anticipated to be useful for future coding applications in MF6, this could be an elegant way to provide a caching capability. If not, might it be preferable in the current application to forgo the decorator and have the cached_gradients array (like the cached_node_distance array) reside in UTVD, which could do the recalculation and caching its set_field procedure? (And UTVD's compute procedure would directly draw values from the cache.) In the end, this should be equivalent to what's happening now, but without the decorator as an intermediary. I realize you might have considered this and decided to go with the decorator because of its potential applicability in other contexts.

If the decorator were to be eliminated, could the gradient object reside only in UTVD and be created in that type's constuctor, rather than in the TspAdv constuctor? UTVD would be the only module that needs it -- TspAdv currently needs it only to create it as a LeastSquaresGradient, move it to a CachedGradient, and pass it to UTVD.

@aprovost-usgs I considered both cases you mentioned. I first thought about moving the gradient creation and use entirely into the UTVD class. As you said it is only used there so it makes sense it would reside there.

However i think there is much in potential in having the gradient available in other places/packages. Having it in the tsp-adv file makes it easier for others to find and use.

There are 2 use cases in have in mind for which the gradient is needed:

Determining the gradient at the cell boundary. In the paper of Darwish there is an equation in which he determines the gradient at a cell face by use cell gradients of the connected cells

Implementing other limiters like the Barth-Jespersen limiter. They make use of gradients as well

Manangka added 19 commits June 23, 2025 12:46

Refactor tsp-adv. Move different interpolation scheme to separate files

a645b9b

Add the SVD-algorithm and the pseudoinverse method

e4f8898

Add the new unstructured TVD limiter

dfeb932

Fix merge

1306626

Add UTVD tests to existing unit tests. Fix correct stencil size on ex…

fc0aa31

…changes

Apply review comments

bf2dcc7

Fix lint error

bdbffa9

Apply review comments

49612ab

Apply review comment. Use pinv directly on the distance matrix omstea…

aeb5f72

…d of the G matrix

Merge branch 'develop' into unstructured_fluxlimiter

2b861a7

# Conflicts: # make/makefile

Merge branch 'develop' into unstructured_fluxlimiter

5f85da7

# Conflicts: # make/makefile # src/Model/TransportModel/tsp-adv.f90

Optimize the limiter by caching several values

92028a6

Clean up

fa0bba0

Merge branch 'develop' into utvd-optimization

18abd93

# Conflicts: # src/Model/TransportModel/InterpolationScheme/UTVDScheme.f90

Fix deconstructor call

85e752e

Move some function around

b3cd26c

Rename variable to better match documentation nomenclature

8f28300

Merge branch 'develop' into utvd-optimization

ec4cc2b

Manangka marked this pull request as ready for review August 11, 2025 11:53

Manangka requested review from aprovost-usgs and mjr-deltares August 11, 2025 11:53

aprovost-usgs reviewed Aug 13, 2025

View reviewed changes

Apply review comments. Make the flow more explicit

c2c0b1a

aprovost-usgs reviewed Aug 19, 2025

View reviewed changes

Manangka added 2 commits August 27, 2025 10:43

Merge branch 'develop' into utvd-optimization

6ccb97f

Merge branch 'develop' into utvd-optimization

6e0fc02

Manangka merged commit 9959bfd into MODFLOW-ORG:develop Sep 1, 2025
20 checks passed

wpbonelli added this to the 6.7.0 milestone Sep 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

(feat) Utvd optimization #2427

(feat) Utvd optimization #2427

Uh oh!

Manangka commented Aug 1, 2025 •

edited

Loading

Uh oh!

aprovost-usgs left a comment

Uh oh!

aprovost-usgs Aug 13, 2025 •

edited

Loading

Uh oh!

aprovost-usgs Aug 13, 2025

Uh oh!

aprovost-usgs Aug 19, 2025

Uh oh!

Manangka Aug 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

(feat) Utvd optimization #2427

(feat) Utvd optimization #2427

Uh oh!

Conversation

Manangka commented Aug 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aprovost-usgs left a comment

Choose a reason for hiding this comment

Uh oh!

aprovost-usgs Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aprovost-usgs Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

aprovost-usgs Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

Manangka Aug 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Manangka commented Aug 1, 2025 •

edited

Loading

aprovost-usgs Aug 13, 2025 •

edited

Loading