Traces & Metrics: Scan Plan #1502

sdd · 2025-07-12T00:57:16Z

Which issue does this PR close?

Follows on from Investigation about tracing, logging, and metrics support. #482
builds on refactor: TableScan file plan generation now implemented purely in streams rather than channels #1486
Related to Metrics Reporter API #1466

What changes are included in this PR?

Only commit 2a02e55 is relevant - I branched off #482 which is unmerged and so the commit for that PR shows up in here too for now.

Implements an initial set of traces and metrics on the table scan plan phase.

Approach guided by prior discussions on #482 and on monthly Iceberg-rust community sync discussion a few months ago.

Are these changes tested?

An integration test has been added that also functions as a mini example. Jaeger has been added to the integration test docker compose stack, and the test exports traces to it which can be viewed by browsing to Jaeger. If you're running the integration tests, setting the env var ICEBERG_INTEG_TEST_PERSISTENT_DOCKER_STACK=1 will ensure that the docker stack is kept up between test runs so that you can view traces on the Jaeger container by browsing to http://localhost:16686

Configuration and Example Output

`info` level traces

Here's an example of how the traces look with RUST_LOG specifying iceberg=info. The aim at this level is for the trace to only contain spans that represent the calls made to the public API.

We see just a single span for plan_files. There are attributes on the span for the scan predicate, the field ids, case sensitivity, and the scan's snapshot id.

`debug` level traces

With RUST_LOG setting iceberg=debug, the trace is more verbose, but not overly so. This is intended as a good default for when you want to see how long the major sub-tasks of a file plan take, as well as some info on file pruning. Here's what the trace at this level looks like when expanded just one level:

Expanding one more level, we can see the individual data file manifest spans - useful for seeing how many manifests there are, how many are processed in parallel, and what kind of variance there is between the retrieval and processing times of each.

Expanding the detail for one of those manifests, we see attributes detailing the file path of the manifest in question and the number of entries within it.

Inspecting the object_cache.get_manifest span shows us that this was a cache miss (kind of obvious given the span duration, but useful when analysing traces in aggregate).

Finally, looking at the deepest levels in the debug level trace shows us more detail for a given manifest. We see the time taken to retrieve that manifest, plus individual spans representing each manifest entry / data file. The expanded details for each of these contain the data file path, plus info on the filtering applied. We see here for this file that it was filtered out because its partition did not match the predicate.

`trace` level traces

The most detailed level of trace is enabled by specifying iceberg=trace in RUST_LOG. This may generate traces that are too large for general use but could be useful when activated selectively for troubleshooting specific issues.

At this level we get spans for the eval calls on the expression evaluator and metrics evaluator, showing the time taken to evaluate the predicate. Not usually useful but I've found it occasionally useful when I've received pathologically large queries on very fragmented tables.

…g a source of deadlock

…er to integ test docker stack

…ats in the trace

sdd · 2025-07-19T13:44:52Z

Extended to add trace coverage to arrow reader too. Timings similar to the scan, plus per-data-file row selection / filtering stats:

emkornfield · 2025-08-20T20:51:03Z

I'm new to reviewing, but the code itself mostly looks fine. My main concern is around potential span bloat. the "eval" level trace and the per manifest level traces seems like they could easily blow-out what most systems have configured for trace depth (e.g. maybe these should be debug only)?

sdd · 2025-09-01T06:32:53Z

I'm new to reviewing, but the code itself mostly looks fine. My main concern is around potential span bloat. the "eval" level trace and the per manifest level traces seems like they could easily blow-out what most systems have configured for trace depth (e.g. maybe these should be debug only)?

As mentioned in the description, the "eval" spans are actually at TRACE level, one level below DEBUG.

emkornfield · 2025-09-03T15:17:01Z

As mentioned in the description, the "eval" spans are actually at TRACE level, one level below DEBUG.

Sorry I missed this. I still struggle to see how this level of detail would actually be useful (vs something like an aggregate counter), do you have a use-case in mind? But as long as TRACE level adds no overhead when not enabled it seems fine.

If this level was already agreed upon in the sync, then please ignore the comment.

sdd added 2 commits July 4, 2025 19:30

refactor: scan plan now pure streams rather than channels, eliminatin…

b1a4447

…g a source of deadlock

feat: initial instrumentation of scan plan with traces and metrics

0a8872f

sdd force-pushed the scan-plan-instrumentation branch from 2a02e55 to 0a8872f Compare July 12, 2025 19:57

This was referenced Jul 13, 2025

Metrics Reporting apache/iceberg-go#485

Open

Support Rest Catalog Metrics Endpoint apache/iceberg-python#474

Open

sdd marked this pull request as ready for review July 18, 2025 07:16

sdd mentioned this pull request Jul 18, 2025

Investigation about tracing, logging, and metrics support. #482

Closed

feat: update scan plan instrumentation. Add o11y integ test. Add Jaeg…

d4f440a

…er to integ test docker stack

sdd force-pushed the scan-plan-instrumentation branch from af106a7 to d4f440a Compare July 19, 2025 08:36

sdd added 2 commits July 19, 2025 09:38

feat: add tracing instrumentation to arrow reader

11e5286

feat: extend arrow reader instrumentation to capture row filtering st…

50f607d

…ats in the trace

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Traces & Metrics: Scan Plan #1502

Traces & Metrics: Scan Plan #1502

Uh oh!

sdd commented Jul 12, 2025 •

edited

Loading

Uh oh!

sdd commented Jul 19, 2025

Uh oh!

emkornfield commented Aug 20, 2025

Uh oh!

sdd commented Sep 1, 2025 •

edited

Loading

Uh oh!

emkornfield commented Sep 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Traces & Metrics: Scan Plan #1502

Are you sure you want to change the base?

Traces & Metrics: Scan Plan #1502

Uh oh!

Conversation

sdd commented Jul 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

What changes are included in this PR?

Are these changes tested?

Configuration and Example Output

info level traces

debug level traces

trace level traces

Uh oh!

sdd commented Jul 19, 2025

Uh oh!

emkornfield commented Aug 20, 2025

Uh oh!

sdd commented Sep 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

emkornfield commented Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

sdd commented Jul 12, 2025 •

edited

Loading

`info` level traces

`debug` level traces

`trace` level traces

sdd commented Sep 1, 2025 •

edited

Loading

emkornfield commented Sep 3, 2025 •

edited

Loading