feat(trace-utils)!: search all spans to populate tracer payload fields by duncanpharvey · Pull Request #1954 · DataDog/libdatadog

duncanpharvey · 2026-05-06T19:47:17Z

What does this PR do?

When extracting span tags to set in the payload, iterate through each trace to find a non-empty value for the tag. Search root spans on each trace first then additional spans if the root span has an empty value for the specified tag.

Motivation

If a trace payload contains multiple trace chunks, and one of those chunks does not contain a trace that has a value for a given tag (like version), then the top level field set in the tracer payload could be empty even when other traces could have non-empty values for the same tag.

This situation arose with the .NET tracer sending a command_execution span with no version in the same payload as an azure_functions.invoke span with a version. As a result, the trace metric generated from this payload, trace.azure_functions.invoke.hits, did not have a version tag set.

https://datadoghq.atlassian.net/browse/SVLS-9006

Additional Notes

Mimics searchTraceForField from Go Agent.
Renamed public struct RootSpanTags to TracerPayloadTags to better reflect the source of the populated fields.

How to test the change?

Unit tests:

First trace root span has no fields. Second trace root span has all fields. The second root span should populate all fields.
Root span has no fields. Child span has all fields. The child span should populate all fields.
Root span has all fields. Child has different values for all fields. The root span should populate all fields.
Root span has empty values for all fields. Child span has a non-empty values. The child span should populate all fields.
env value is normalized by collect_pb_trace_chunks
Empty env values after normalization are skipped. in collect_pb_trace_chunks
A span with the same span_id as root is treated as the root and skipped in the child span search. Only the root span's own meta is checked for it.

github-actions · 2026-05-06T19:49:57Z

Clippy Allow Annotation Report

Comparing clippy allow annotations between branches:

Base Branch: origin/main
PR Branch: origin/duncan-harvey/tracer-payload-check-all-traces

Summary by Rule

Rule	Base Branch	PR Branch	Change
todo	2	2	No change (0%)
unwrap_used	5	5	No change (0%)
Total	7	7	No change (0%)

Annotation Counts by File

File	Base Branch	PR Branch	Change
`libdd-trace-utils/src/send_data/mod.rs`	5	5	No change (0%)
`libdd-trace-utils/src/trace_utils.rs`	2	2	No change (0%)

Annotation Stats by Crate

Crate	Base Branch	PR Branch	Change
`clippy-annotation-reporter`	5	5	No change (0%)
`datadog-ffe-ffi`	1	1	No change (0%)
`datadog-ipc`	21	21	No change (0%)
`datadog-live-debugger`	6	6	No change (0%)
`datadog-live-debugger-ffi`	10	10	No change (0%)
`datadog-profiling-replayer`	4	4	No change (0%)
`datadog-remote-config`	3	3	No change (0%)
`datadog-sidecar`	57	57	No change (0%)
`libdd-common`	13	13	No change (0%)
`libdd-common-ffi`	12	12	No change (0%)
`libdd-data-pipeline`	5	5	No change (0%)
`libdd-ddsketch`	2	2	No change (0%)
`libdd-dogstatsd-client`	1	1	No change (0%)
`libdd-profiling`	13	13	No change (0%)
`libdd-telemetry`	20	20	No change (0%)
`libdd-tinybytes`	4	4	No change (0%)
`libdd-trace-normalization`	2	2	No change (0%)
`libdd-trace-obfuscation`	8	8	No change (0%)
`libdd-trace-stats`	1	1	No change (0%)
`libdd-trace-utils`	15	15	No change (0%)
Total	203	203	No change (0%)

About This Report

This report tracks Clippy allow annotations for specific rules, showing how they've changed in this PR. Decreasing the number of these annotations generally improves code quality.

codecov-commenter · 2026-05-06T20:00:00Z

Codecov Report

❌ Patch coverage is 95.13274% with 11 lines in your changes missing coverage. Please review.
✅ Project coverage is 72.63%. Comparing base (2ce59aa) to head (fd0ce76).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1954      +/-   ##
==========================================
+ Coverage   72.54%   72.63%   +0.09%     
==========================================
  Files         451      451              
  Lines       74481    74687     +206     
==========================================
+ Hits        54030    54249     +219     
+ Misses      20451    20438      -13

Components	Coverage Δ
libdd-crashtracker	`65.31% <ø> (-0.02%)`	⬇️
libdd-crashtracker-ffi	`37.68% <ø> (ø)`
libdd-alloc	`98.77% <ø> (ø)`
libdd-data-pipeline	`85.97% <ø> (ø)`
libdd-data-pipeline-ffi	`71.04% <ø> (ø)`
libdd-common	`79.81% <ø> (ø)`
libdd-common-ffi	`74.41% <ø> (ø)`
libdd-telemetry	`73.34% <ø> (-0.03%)`	⬇️
libdd-telemetry-ffi	`31.36% <ø> (ø)`
libdd-dogstatsd-client	`82.64% <ø> (ø)`
datadog-ipc	`76.22% <ø> (ø)`
libdd-profiling	`81.57% <ø> (ø)`
libdd-profiling-ffi	`64.51% <ø> (ø)`
libdd-sampling	`97.25% <ø> (ø)`
datadog-sidecar	`29.09% <ø> (ø)`
datdog-sidecar-ffi	`9.67% <ø> (ø)`
spawn-worker	`48.86% <ø> (ø)`
libdd-tinybytes	`93.16% <ø> (ø)`
libdd-trace-normalization	`81.71% <ø> (ø)`
libdd-trace-obfuscation	`87.39% <ø> (ø)`
libdd-trace-protobuf	`68.25% <ø> (ø)`
libdd-trace-utils	`89.59% <95.13%> (+0.73%)`	⬆️
libdd-tracer-flare	`86.88% <ø> (ø)`
libdd-log	`74.83% <ø> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

datadog-datadog-prod-us1-2 · 2026-05-06T20:00:37Z

🎉 All green!

❄️ No new flaky tests detected
🧪 All tests passed

🎯 Code Coverage (details)
• Patch Coverage: 95.13%
• Overall Coverage: 72.64% (+0.09%)

_{This comment will be updated automatically if new data arrives.

🔗 Commit SHA: fd0ce76 | Docs | Datadog PR Page | Give us feedback!}

dd-octo-sts · 2026-05-06T20:22:23Z

Artifact Size Benchmark Report

aarch64-alpine-linux-musl

Artifact	Baseline	Commit	Change
/aarch64-alpine-linux-musl/lib/libdatadog_profiling.so	7.57 MB	7.57 MB	0% (0 B) 👌
/aarch64-alpine-linux-musl/lib/libdatadog_profiling.a	81.84 MB	81.84 MB	0% (0 B) 👌

aarch64-unknown-linux-gnu

Artifact	Baseline	Commit	Change
/aarch64-unknown-linux-gnu/lib/libdatadog_profiling.a	98.03 MB	98.03 MB	0% (0 B) 👌
/aarch64-unknown-linux-gnu/lib/libdatadog_profiling.so	10.01 MB	10.01 MB	0% (0 B) 👌

libdatadog-x64-windows

Artifact	Baseline	Commit	Change
/libdatadog-x64-windows/debug/dynamic/datadog_profiling_ffi.dll	24.48 MB	24.48 MB	0% (0 B) 👌
/libdatadog-x64-windows/debug/dynamic/datadog_profiling_ffi.lib	79.87 KB	79.87 KB	0% (0 B) 👌
/libdatadog-x64-windows/debug/dynamic/datadog_profiling_ffi.pdb	180.23 MB	180.21 MB	-0% (-16.00 KB) 👌
/libdatadog-x64-windows/debug/static/datadog_profiling_ffi.lib	913.96 MB	913.96 MB	+0% (+4.69 KB) 👌
/libdatadog-x64-windows/release/dynamic/datadog_profiling_ffi.dll	7.73 MB	7.73 MB	0% (0 B) 👌
/libdatadog-x64-windows/release/dynamic/datadog_profiling_ffi.lib	79.87 KB	79.87 KB	0% (0 B) 👌
/libdatadog-x64-windows/release/dynamic/datadog_profiling_ffi.pdb	23.17 MB	23.17 MB	0% (0 B) 👌
/libdatadog-x64-windows/release/static/datadog_profiling_ffi.lib	45.36 MB	45.36 MB	0% (0 B) 👌

libdatadog-x86-windows

Artifact	Baseline	Commit	Change
/libdatadog-x86-windows/debug/dynamic/datadog_profiling_ffi.dll	21.09 MB	21.09 MB	0% (0 B) 👌
/libdatadog-x86-windows/debug/dynamic/datadog_profiling_ffi.lib	81.11 KB	81.11 KB	0% (0 B) 👌
/libdatadog-x86-windows/debug/dynamic/datadog_profiling_ffi.pdb	184.40 MB	184.40 MB	0% (0 B) 👌
/libdatadog-x86-windows/debug/static/datadog_profiling_ffi.lib	900.43 MB	900.44 MB	+0% (+4.82 KB) 👌
/libdatadog-x86-windows/release/dynamic/datadog_profiling_ffi.dll	5.99 MB	5.99 MB	0% (0 B) 👌
/libdatadog-x86-windows/release/dynamic/datadog_profiling_ffi.lib	81.11 KB	81.11 KB	0% (0 B) 👌
/libdatadog-x86-windows/release/dynamic/datadog_profiling_ffi.pdb	24.81 MB	24.81 MB	0% (0 B) 👌
/libdatadog-x86-windows/release/static/datadog_profiling_ffi.lib	42.87 MB	42.87 MB	0% (0 B) 👌

x86_64-alpine-linux-musl

Artifact	Baseline	Commit	Change
/x86_64-alpine-linux-musl/lib/libdatadog_profiling.a	72.93 MB	72.93 MB	0% (0 B) 👌
/x86_64-alpine-linux-musl/lib/libdatadog_profiling.so	8.42 MB	8.42 MB	0% (0 B) 👌

x86_64-unknown-linux-gnu

Artifact	Baseline	Commit	Change
/x86_64-unknown-linux-gnu/lib/libdatadog_profiling.a	90.70 MB	90.70 MB	0% (0 B) 👌
/x86_64-unknown-linux-gnu/lib/libdatadog_profiling.so	10.06 MB	10.06 MB	0% (0 B) 👌

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: fd5c1b9464

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-06T20:54:56Z

+    if let Some(v) = root.meta.get(field) {
+        return Some(v.clone());


Skip empty trace tag values when searching spans

When a root span carries the requested key with an empty string (for example version: "") and a child span in the same trace has the non-empty value, this returns immediately with the empty value and never checks the remaining spans. The caller then leaves app_version/env/etc. empty for single-trace payloads, which misses the stated goal of finding a non-empty value when the root span has an empty tag.

Useful? React with 👍 / 👎.

Addressed in 3c10b32

duncanpharvey · 2026-05-06T21:05:18Z

@codex review

chatgpt-codex-connector · 2026-05-06T21:08:08Z

Codex Review: Didn't find any major issues. What shall we delve into next?

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Lewis-E · 2026-05-07T17:48:37Z

        assert!(!span.meta.contains_key("aas.site.type"));
    }
+
+    #[test]


So I was look at pb:Span, and it looks like there's a bunch of fuzzing used to test that code. Do you think that a bolero check!() type test might be useful here? How well formed are trace payloads, generally?

collect_pb_trace_chunks, where search_trace_for_field is called, is certainly complex enough to warrant a bolero test in general for everything that it's doing. But for what we want to test as part of this PR in search_trace_for_field - "best effort, first non-empty value wins from any span regardless of structural validity". It just means the bolero test can only assert "if a value is returned, it came from some span in the trace", which is already guaranteed from the other unit tests.

Lewis-E · 2026-05-07T17:56:15Z

+            // from an earlier trace.
+            let root = &trace[root_span_index];
+            if tracer_payload_tags.env.is_empty() {
+                if let Some(v) = search_trace_for_field(root, trace, "env") {


So this trace gets normalized up here right: https://github.com/DataDog/libdatadog/pull/1954/changes#diff-e35e80d6b64d2cc411fc38dad84cf1cf35bd47501f9ef20761c1d20c1b5abd26R631

But it looks like that normalization can have errors, which are logged, but don't break the loop. This is sort of why I'm wondering about fuzzing/trace payloads. Can we get incorrect children passing on incorrect tags?

Between these normalize functions, only the env field is normalized for the fields relevant to the tracer payload change (env, version, _dd.hostname, runtime-id).

normalize_trace

normalize_span

normalize_chunk

There is a scenario where normalization fails on a span and later spans are not normalized, resulting in potentially non-normalized env values being set on the tracer payload. I think the safest way to handle this scenario would be to apply normalization in collect_pb_trace_chunks rather than modifying how normalize_trace short circuits on an error when iterating over spans.

if let Some(mut v) = search_trace_for_field(root, trace, "env") { normalize_utils::normalize_tag(&mut v); tracer_payload_tags.env = v; }

Applied change here: 00b025c

lucaspimentel · 2026-05-11T20:51:18Z

-                    "version" => root_span_tags.app_version,
-                    "_dd.hostname" => root_span_tags.hostname,
-                    "runtime-id" => root_span_tags.runtime_id,
+        if is_agentless {


When is_agentless set and what does it mean in this context? If this code is running in the agent, how can it be "agentless"? 😅

I think this comes from the fact that these trace utils are used in the tracers in addition to the Serverless Compatibility Layer and Bottlecap. From the tracer perspective, if traces are sent directly to the backend, is_agentless: true means "do the normalization and other tasks the Go agent would normally do." Maybe a more appropriate name for this parameter would be something like perform_agent_processing?

collect_pb_trace_chunks call in Serverless Compatibility Layer

collect_pb_trace_chunks call in Bottlecap

Thanks for clarifying!

Maybe a more appropriate name for this parameter would be something like perform_agent_processing

Makes sense, but probably out scope of this PR.

duncanpharvey · 2026-05-13T20:05:43Z

/merge

gh-worker-devflow-routing-ef8351 · 2026-05-13T20:05:51Z

View all feedbacks in Devflow UI.

2026-05-13 20:05:50 UTC ℹ️ Start processing command /merge

2026-05-13 20:05:57 UTC ℹ️ MergeQueue: waiting for PR to be ready

This pull request is not mergeable according to GitHub. Common reasons include pending required checks, missing approvals, or merge conflicts — but it could also be blocked by other repository rules or settings.
It will be added to the queue as soon as checks pass and/or get approvals. View in MergeQueue UI.
Note: if you pushed new commits since the last approval, you may need additional approval.
You can remove it from the waiting list with /remove command.

2026-05-13 20:39:13 UTC ℹ️ MergeQueue: merge request added to the queue

The expected merge time in main is approximately 47m (p90).

2026-05-13 21:25:09 UTC ℹ️ MergeQueue: This merge request was merged

duncanpharvey added 2 commits May 6, 2026 15:46

use span tags from all root spans in tracer payload

e2f014a

search non root spans for fields

fd5c1b9

github-actions Bot added the mini-agent label May 6, 2026

duncanpharvey changed the title ~~feat(trace-utils): search all spans to populate tracer payload fields~~ feat(trace-utils)!: search all spans to populate tracer payload fields May 6, 2026

duncanpharvey marked this pull request as ready for review May 6, 2026 20:53

duncanpharvey requested review from a team as code owners May 6, 2026 20:53

duncanpharvey requested review from a team, Lewis-E and lucaspimentel and removed request for a team May 6, 2026 20:53

chatgpt-codex-connector Bot reviewed May 6, 2026

View reviewed changes

only return non-empty values in search_trace_for_field

3c10b32

Lewis-E reviewed May 7, 2026

View reviewed changes

duncanpharvey added 3 commits May 8, 2026 15:16

normalize env tag in case span was skipped during normalization

00b025c

apply formatting

963dadf

add more unit tests

d18934e

duncanpharvey requested a review from Lewis-E May 8, 2026 20:07

Lewis-E approved these changes May 8, 2026

View reviewed changes

VianneyRuhlmann approved these changes May 11, 2026

View reviewed changes

lucaspimentel reviewed May 11, 2026

View reviewed changes

Comment thread libdd-trace-utils/src/trace_utils.rs

lucaspimentel reviewed May 11, 2026

View reviewed changes

Comment thread libdd-trace-utils/src/trace_utils.rs

lucaspimentel reviewed May 11, 2026

View reviewed changes

add more unit tests

551934a

duncanpharvey requested a review from lucaspimentel May 12, 2026 15:46

duncanpharvey mentioned this pull request May 13, 2026

feat: add fallback to DD_VERSION for trace stats computation DataDog/serverless-components#125

Closed

lucaspimentel approved these changes May 13, 2026

View reviewed changes

Merge branch 'main' into duncan-harvey/tracer-payload-check-all-traces

fd0ce76

gh-worker-dd-devflow-36fce6 Bot added mergequeue-status: waiting mergequeue-status: queued mergequeue-status: in_progress and removed mergequeue-status: waiting mergequeue-status: queued labels May 13, 2026

gh-worker-dd-mergequeue-cf854d Bot merged commit 0a3304c into main May 13, 2026
114 checks passed

gh-worker-dd-mergequeue-cf854d Bot deleted the duncan-harvey/tracer-payload-check-all-traces branch May 13, 2026 21:25

gh-worker-dd-devflow-36fce6 Bot added mergequeue-status: done and removed mergequeue-status: in_progress labels May 13, 2026

		if let Some(v) = root.meta.get(field) {
		return Some(v.clone());

Conversation

duncanpharvey commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Motivation

Additional Notes

How to test the change?

Uh oh!

github-actions Bot commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Clippy Allow Annotation Report

Summary by Rule

Annotation Counts by File

Annotation Stats by Crate

About This Report

Uh oh!

codecov-commenter commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

datadog-datadog-prod-us1-2 Bot commented May 6, 2026 • edited by datadog-datadog-prod-us1 Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dd-octo-sts Bot commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Artifact Size Benchmark Report

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 6, 2026

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

duncanpharvey commented May 6, 2026

Uh oh!

chatgpt-codex-connector Bot commented May 6, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lucaspimentel May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

duncanpharvey commented May 13, 2026

Uh oh!

gh-worker-devflow-routing-ef8351 Bot commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

duncanpharvey commented May 6, 2026 •

edited

Loading

github-actions Bot commented May 6, 2026 •

edited

Loading

codecov-commenter commented May 6, 2026 •

edited

Loading

datadog-datadog-prod-us1-2 Bot commented May 6, 2026 •

edited by datadog-datadog-prod-us1 Bot

Loading

dd-octo-sts Bot commented May 6, 2026 •

edited

Loading

lucaspimentel May 11, 2026 •

edited

Loading

gh-worker-devflow-routing-ef8351 Bot commented May 13, 2026 •

edited

Loading