Block to cpu function changes of base_block #620

dkazanc · 2025-08-22T15:17:12Z

Fixes #619

An additional note: The data input to rescale_to_int (CPU) needs to be C-contiguous and one approach would be to do the following using Numpy's API instead of the CuPy's API as before in here:

   if not gpu_enabled or xp.get_array_module(self.data).__name__ == "numpy":
        self._data = np.asarray(self.data, order="C")
        return

However, I decided to add conversion to C-contiguous on the method side as this is the requirement of the method, mostly due to the C-wrapped code we use. Also when the data on the CPU is converted to C-contiguous by the framework it is counted by the montior as a GPU transfer, which is confusing when plotting the times.

However, it would be interesting to know why the data becomes non C-contiguous when it is written in the sink or read by the source. It is a chance to remove unnecessary data copy with np.asarray(data, order="C")

Checklist

I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I have made corresponding changes to the documentation

yousefmoazzam · 2025-08-28T13:46:58Z

I'm curious, is the order="C" that's passed to np.empty() making any difference to whatever behaviour was observed?

We use numpy v1.26.x from what I see in the CI, and that version's docs for np.empty say that order has a default value of "C", so naively I would have thought that adding in order="C" doesn't do anything different?

yousefmoazzam · 2025-08-28T13:47:18Z

However, it would be interesting to know why the data becomes non C-contiguous when it is written in the sink or read by the source.

If you're talking about the numpy array inside a block that is either:

read from a source
written to a sink

I would imagine that, in general, those numpy arrays are not C-contiguous.

This is because numpy arrays inside blocks that are read from a source or written to a sink originate from slicing the original numpy array that represents the chunk associated with a process for a given section, and generating a numpy array via slicing another numpy array doesn't produce a C-contiguous numpy array in general.

Only under certain circumstances will slicing produce a C-contiguous numpy array, due to how slicing may or may not produce a view of the original data that is still C-contiguous (in particular, whether the elements within the view are stored contiguously in row-major order or not):

>>> import numpy as np
>>> arr = np.empty((5, 10, 10))
>>> arr.flags.c_contiguous
True
>>> first_dim_slice = arr[1:3, :, :]
>>> first_dim_slice.flags.c_contiguous
True
>>> second_dim_slice = arr[:, 2:4, :]
>>> second_dim_slice.flags.c_contiguous
False

So, my naive assumption would be that numpy arrays representing chunks are C-contiguous (due to being created by np.empty() and copying relevant data into them), but numpy arrays representing blocks in general are not C-contiguous.

As a reference, numpy docs here mention how slicing a numpy array often produces a "view" of the original numpy array.

dkazanc · 2025-09-01T08:24:13Z

Thanks for the clarification @yousefmoazzam . You're, indeed, right about default: ‘C’for the np.empty function and I see what you mean about the non C-contiguous blocks.
As I mentioned, we're converting non C-contiguous on the function side, when it is needed, so essentially no changes needed for the framework. I guess the main change here is to prevent CuPy API not to be applied to numpy arrays.
The last bit is a little change for monitor where do not need to track time for D2H/H2D for CPU methods.

yousefmoazzam

Apologies, I forgot about this PR!

Ok, it sounds like adding order="C" to the np.empty() calls indeed makes no difference, so could those be removed please.

dkazanc · 2025-09-12T08:44:34Z

OK to merge @yousefmoazzam ?

dkazanc added 7 commits August 22, 2025 15:09

additional check if numpy array before executing CuPy transfer

c0bf959

changes to monitor

e6080b0

making numpy array C-contiguous

0bab80e

making numpy array C-contiguous2

64cf3e2

making numpy array C-contiguous3

05de71f

making numpy array C-contiguous4

5e492bc

making numpy array C-contiguous5

e87581d

Merge branch 'main' into block_to_cpu

4762507

yousefmoazzam requested changes Sep 8, 2025

View reviewed changes

dkazanc added 2 commits September 12, 2025 09:42

removing C order and fixing tests

4828f44

Merge branch 'main' into block_to_cpu

a1f4f4e

yousefmoazzam approved these changes Sep 12, 2025

View reviewed changes

dkazanc merged commit c7ed01c into main Sep 12, 2025
6 checks passed

dkazanc deleted the block_to_cpu branch September 12, 2025 09:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Block to cpu function changes of base_block #620

Block to cpu function changes of base_block #620

Uh oh!

dkazanc commented Aug 22, 2025 •

edited

Loading

Uh oh!

yousefmoazzam commented Aug 28, 2025

Uh oh!

yousefmoazzam commented Aug 28, 2025

Uh oh!

dkazanc commented Sep 1, 2025

Uh oh!

yousefmoazzam left a comment

Uh oh!

dkazanc commented Sep 12, 2025

Uh oh!

Uh oh!

Uh oh!

Block to cpu function changes of base_block #620

Block to cpu function changes of base_block #620

Uh oh!

Conversation

dkazanc commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Uh oh!

yousefmoazzam commented Aug 28, 2025

Uh oh!

yousefmoazzam commented Aug 28, 2025

Uh oh!

dkazanc commented Sep 1, 2025

Uh oh!

yousefmoazzam left a comment

Choose a reason for hiding this comment

Uh oh!

dkazanc commented Sep 12, 2025

Uh oh!

Uh oh!

Uh oh!

dkazanc commented Aug 22, 2025 •

edited

Loading