Use a generator for instance segmentation masks #1604

shntu · 2025-09-30T14:55:18Z

Description

I am currently testing this changeset, but I notice a significant slowdown when loading large instance segmentation masks in memory - currently, it is not possible to access the class or confidence without also loading the entire mask as well. By using a generator, this code reduces memory usage by enabling access to the labels and classes of predictions without requiring the entire mask to be loaded also.

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)

How has this change been tested, please provide a testcase or example of how you tested the change?

Currently profiling the change to see if it improves performance on edge devices (Jetson Orin 16GB)

Any specific deployment considerations

This does change the type from a list to a generator, but should not change any APIs or dependencies.

Docs

N/A

shntu · 2025-09-30T16:30:38Z

inference/core/models/instance_segmentation_base.py

-            predictions = []
-            for pred, mask in zip(batch_predictions, batch_masks):
-                if class_filter and not self.class_names[int(pred[6])] in class_filter:
-                    # TODO: logger.debug


Would a debug log be necessary? This PR deletes this TODO since it didn't seem especially helpful, but I can also add logs if needed to the generator.

iurisilvio · 2025-10-01T16:02:30Z

inference/models/yolact/yolact_instance_segmentation.py

        responses = [
            InstanceSegmentationInferenceResponse(
-                predictions=[
+                predictions=(


Isn't it converted to list before rendering? Not sure if it help with memory usage.

Currently, there are a handful of enterprise users who import InferencePipeline and run it without actually using any of the HTTP responses.

I'm closing this PR because it indeed does not solve the problem - the memory usage is much higher on instance segmentation execution, and reducing the memory used by these masks is a very small fraction of it (would require a much larger change to actually reduce the memory pressure on the Jetson).

shntu and others added 3 commits September 30, 2025 10:52

Use a generator for instance segmentation masks

4bd6f74

Merge branch 'main' into sb/generate-seg-masks

0a0aa4e

Merge branch 'main' into sb/generate-seg-masks

5a003ff

shntu commented Sep 30, 2025

View reviewed changes

iurisilvio reviewed Oct 1, 2025

View reviewed changes

shntu closed this Oct 2, 2025

shntu deleted the sb/generate-seg-masks branch October 2, 2025 14:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use a generator for instance segmentation masks #1604

Use a generator for instance segmentation masks #1604

Uh oh!

shntu commented Sep 30, 2025

Uh oh!

shntu Sep 30, 2025

Uh oh!

iurisilvio Oct 1, 2025

Uh oh!

shntu Oct 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

Use a generator for instance segmentation masks #1604

Use a generator for instance segmentation masks #1604

Uh oh!

Conversation

shntu commented Sep 30, 2025

Description

Type of change

How has this change been tested, please provide a testcase or example of how you tested the change?

Any specific deployment considerations

Docs

Uh oh!

shntu Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

iurisilvio Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

shntu Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

shntu Oct 2, 2025 •

edited

Loading