REP-6492 Switch to $sampleRate-style partitioning #128

FGasper · 2025-08-18T19:40:42Z

$sample-based partitioning has proven problematic for some years now because it often creates highly-imbalanced partitions.

This changeset switches partitioning to use $sampleRate instead. Because this entails a full index scan it tends to be slower; we offset that by creating partition tasks immediately as we receive sampled partition boundaries rather than all at once at the end of the aggregation.

Because MongoDB 4.2 lacked $sampleRate (and $rand as well), the legacy partitioning logic remains for use with that server version.

Both legacy & $sampleRate partitioning are made to use available read concern and secondaryPreferred read preference. These aggregations don’t need consistency, but they benefit substantially from speed & minimizing workload on the primary.

A few simplifications are made here as well. For example, MongosyncID is removed from the PartitionKey struct since it’s never actually relevant, and certain parameters to the legacy partitioner are made constant (since they were always used thus).

FGasper added 6 commits August 20, 2025 13:13

axe “Replicator”

edb1f7e

remove unneeded params

e7d1881

depends on $type fix

9a5d480

log

a6f58d8

use avail/near

fdd3ce3

dedupe

ab5b493

FGasper force-pushed the REP-6492-samplerate-partition branch from 180b79a to ab5b493 Compare August 20, 2025 17:13

FGasper added 5 commits August 20, 2025 15:20

add retry for partitioning

2eb47a9

add UUID to partitions

84d3e00

no MongosyncID

8f456e1

2ndary preferred

8bc0c1d

fix

efdc93c

FGasper requested a review from tdq45gj August 20, 2025 19:40

FGasper marked this pull request as ready for review August 20, 2025 19:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

REP-6492 Switch to $sampleRate-style partitioning #128

REP-6492 Switch to $sampleRate-style partitioning #128

FGasper commented Aug 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

REP-6492 Switch to $sampleRate-style partitioning #128

Are you sure you want to change the base?

REP-6492 Switch to $sampleRate-style partitioning #128

Conversation

FGasper commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

FGasper commented Aug 18, 2025 •

edited

Loading