How should legacy B3 and W3C propagators be handled during migration?

Deploy a multi-propagator configuration. The SDK attempts extraction in priority order (W3C first, then B3). Once all services emit W3C-compliant headers, remove legacy propagators. Never strip incoming B3 headers at ingress until every downstream dependency supports W3C extraction.

SDK Implementation & Context Propagation

When a request crosses a service boundary without a properly initialized OpenTelemetry SDK, the trace dies at the edge. The receiving service has no parent span to attach to, so it opens a new root span — and the end-to-end picture fragments into isolated islands of telemetry. Engineers hunting a latency regression see two disconnected traces instead of a single causal chain; the actual cross-service call that caused the spike is invisible.

The problem compounds in polyglot, multi-region microservices. A Python API calls a Go gRPC service, which publishes to Kafka, which a Java consumer processes asynchronously. Each hop is an opportunity for context loss: a misconfigured propagator, a load balancer that strips headers, an async boundary where the SDK’s execution context was never restored. The sections below give you the architectural grounding and production patterns to close every one of those gaps.

Core Concepts & Terminology

Term	Definition
Trace	The complete record of a single request across all services it touches, identified by a shared `trace_id`.
Span	A named, timed unit of work within a trace. Spans have a `span_id`, optional `parent_span_id`, timestamps, status, and key-value attributes.
Trace ID	A 128-bit globally unique identifier that remains constant across every span in a single request lifecycle.
Propagator	A component that serializes the active context into transport headers (inject) and reconstructs it from incoming headers (extract).
Carrier	The object that physically moves context between processes — an HTTP header map, a Kafka message header, or a gRPC metadata object.
Baggage	A mutable, cross-cutting key-value store that travels alongside the trace context. Intended for tenant IDs, routing hints, and correlation tokens — not high-cardinality data. See baggage vs span attributes for when each is appropriate.
W3C TraceContext	The IETF standard header format (`traceparent`, `tracestate`) for cross-service context propagation.
OTLP	OpenTelemetry Protocol — the canonical wire format for exporting spans, metrics, and logs to a collector or backend.
Resource	A set of key-value attributes that describe the entity producing telemetry (service name, version, deployment environment, host).
Sampler	The component that decides, at span creation time or after all data is collected, whether a trace is recorded and exported.

Architectural Overview

An OpenTelemetry SDK sits between your application code and your telemetry backend. Its job is to produce spans, attach context to outbound calls, extract context from inbound calls, and ship the resulting data via OTLP to a collector that routes it to Jaeger, Tempo, or another backend.

The SDK is composed of four cooperating subsystems:

Tracer Provider — the factory that creates Tracer instances and holds the global SDK configuration (sampler, resource, span processors, exporters).
Span Processors — hooks invoked on span start and end. The BatchSpanProcessor buffers completed spans and flushes them to the exporter on a configurable schedule. The SimpleSpanProcessor exports synchronously, suitable only for development.
Exporters — transport adapters that convert spans to a wire format (OTLP/gRPC, OTLP/HTTP, Zipkin JSON) and deliver them to the collector endpoint.
Propagators — registered globally and invoked automatically by instrumentation libraries to inject and extract context at network boundaries. W3C TraceContext propagation is the default; B3 and Jaeger propagators can be stacked for legacy compatibility.

The data flow for a single HTTP request is: inbound request arrives → propagator extracts traceparent from headers → SDK creates a child span with the extracted parent → application logic runs → instrumentation libraries create child spans for DB calls, outbound HTTP, queue publishes → span processor enqueues completed spans → batch exporter flushes spans to the collector → collector routes to the storage backend.

SDK Initialization & Resource Configuration

Proper SDK bootstrapping dictates telemetry accuracy, routing efficiency, and resource attribution. The initialization phase must establish deterministic resource attributes, configure the sampler before any spans are created, and secure the exporter endpoints. Initializing too late — after HTTP middleware or database drivers are registered — causes early spans to be emitted against an unconfigured provider and silently dropped.

Environment variables provide the baseline configuration layer, ensuring parity across development, staging, and production:

# Service identity
export OTEL_SERVICE_NAME="order-processing-api"
export OTEL_RESOURCE_ATTRIBUTES="deployment.environment=production,service.version=2.4.1,host.name=$(hostname)"

# Sampling
export OTEL_TRACES_SAMPLER="parentbased_traceidratio"
export OTEL_TRACES_SAMPLER_ARG="0.1"

# Export
export OTEL_EXPORTER_OTLP_PROTOCOL="grpc"
export OTEL_EXPORTER_OTLP_ENDPOINT="https://telemetry-collector.internal:4317"

Python SDK initialization with mTLS and resource detectors:

from opentelemetry import trace
from opentelemetry.sdk.trace import TracerProvider
from opentelemetry.sdk.trace.export import BatchSpanProcessor
from opentelemetry.exporter.otlp.proto.grpc.trace_exporter import OTLPSpanExporter
from opentelemetry.sdk.resources import Resource, SERVICE_NAME
from opentelemetry.resourcedetector.kubernetes import KubernetesResourceDetector

# Resource detectors must run synchronously at startup to avoid
# race conditions during early span emission.
k8s_resource = KubernetesResourceDetector().detect()
base_resource = Resource.create({SERVICE_NAME: "order-processing-api"})
resource = base_resource.merge(k8s_resource)

exporter = OTLPSpanExporter(
    endpoint="https://telemetry-collector.internal:4317",
    credentials=grpc.ssl_channel_credentials(
        root_certificates=open("/etc/ssl/certs/collector-ca.pem", "rb").read(),
        private_key=open("/etc/ssl/private/client-key.pem", "rb").read(),
        certificate_chain=open("/etc/ssl/private/client-cert.pem", "rb").read(),
    ),
)

provider = TracerProvider(resource=resource)
provider.add_span_processor(BatchSpanProcessor(exporter))
trace.set_tracer_provider(provider)  # must happen before any import that calls trace.get_tracer()

For language-specific initialization sequences, lifecycle hooks, and credential rotation patterns, see OpenTelemetry SDK Setup for Backend Services.

Instrumentation Models

Instrumentation approaches trade off developer velocity against runtime overhead and telemetry granularity.

Auto-instrumentation attaches at the runtime level — a Java agent, a Node.js --require hook, a Python sitecustomize.py import — and intercepts framework calls automatically. It covers HTTP servers, database clients, and message producers with zero application code changes. The trade-off is that it cannot capture business-logic semantics: the span describes a DB query, not the business intent behind it.

Manual instrumentation requires explicit API calls to define span boundaries, attach semantic attributes, and record exceptions. It is the only way to instrument custom serialization layers, background workers, or business rules that span multiple I/O calls.

// Manual span creation — order processing boundary
const span = tracer.startSpan('process-order', {
    attributes: {
        'order.id': orderId,
        'order.total_usd': totalAmount,
        'order.item_count': items.length,
    },
});

try {
    await executeOrderLogic();
    span.setStatus({ code: SpanStatusCode.OK });
} catch (error) {
    span.recordException(error);
    span.setStatus({ code: SpanStatusCode.ERROR, message: error.message });
} finally {
    span.end(); // always end in finally — a missing span.end() causes a span that never closes
}

Auto-instrumentation for FastAPI and outbound HTTP via HTTPX:

from opentelemetry.instrumentation.fastapi import FastAPIInstrumentor
from opentelemetry.instrumentation.httpx import HTTPXInstrumentor

app = FastAPI()
FastAPIInstrumentor.instrument_app(app)   # wraps every route handler
HTTPXInstrumentor().instrument()          # injects traceparent into every outbound request

Most production services combine both: auto-instrumentation for infrastructure-level spans and manual spans for business boundaries. See Auto-Instrumentation vs Manual Span Creation for benchmark data and hybrid patterns.

Propagation Mechanics

Context propagation follows a strict two-phase contract at every network boundary.

Inject — before sending a request, the SDK serializes the active span context into the carrier (HTTP headers, Kafka message headers, gRPC metadata). Extract — on receiving a request, the SDK deserializes the carrier and reconstructs the parent span context, making it the active context for the duration of that request.

The propagator interface is language-agnostic:

interface TextMapPropagator {
    inject(context: Context, carrier: Carrier, setter: HeaderSetter): void
    extract(carrier: Carrier, getter: HeaderGetter): Context
    fields(): string[]  // header names this propagator reads/writes
}

The W3C TraceContext standard uses two headers:

traceparent: 00-4bf92f3577b34da6a3ce929d0e0e4736-00f067aa0ba902b7-01
tracestate:  vendor1=value1,vendor2=value2

traceparent encodes: version (00), 128-bit trace_id, 64-bit parent_span_id, and trace flags (LSB = sampled). tracestate carries vendor-specific extensions without breaking standard parsers. Keep tracestate values under 4 KB per vendor — some load balancers truncate headers above 8 KB.

Context propagation middleware for Express:

import { context, propagation } from '@opentelemetry/api';

app.use((req, res, next) => {
    // Extract builds a new context with the incoming span as parent
    const extractedContext = propagation.extract(context.active(), req.headers);
    context.with(extractedContext, () => {
        req.otelContext = context.active();
        next();
    });
});

For environments with sidecar proxies or ingress controllers, see Context Propagation Across Service Meshes — Envoy, Linkerd, and Istio all have specific header-forwarding configurations that must be enabled or the traceparent is silently dropped at the mesh layer.

Sampling Strategies Overview

Head-based sampling makes the keep/drop decision at the root span, before any downstream spans are created. The sampled flag in traceparent carries that decision to all services, so the entire trace is either recorded or not. It is simple, low-overhead, and predictable — but it cannot prioritize interesting traces because no outcome data is available yet.

Tail-based sampling defers the decision until the complete trace is assembled. A sampling proxy (the OpenTelemetry Collector’s tail_sampling processor, or a dedicated component like Grafana Tempo’s receiver) evaluates rules against full trace data: error status, latency percentile, specific attribute values. It can keep 100% of error traces and 1% of healthy ones, which is the right trade-off for most production services.

Dimension	Head-based	Tail-based
Decision point	Root span creation	After trace completion
Latency impact	Negligible	Collector memory for in-flight traces
Can target errors/slow traces	No	Yes
Config complexity	Low	Medium–high
Propagation requirement	Sampled flag in `traceparent`	All spans must reach the collector

The primary risk with tail-based sampling is that every span must be buffered until the trace is complete — a 30-second request holds all its spans in memory for 30 seconds. Size the collector accordingly and set a decision_wait timeout to flush incomplete traces after a reasonable window.

Storage & Backend Integration

Completed spans travel from the SDK’s BatchSpanProcessor to an OTLP endpoint — typically the OpenTelemetry Collector — which enriches, filters, and routes them to a storage backend.

The three most common backends are Jaeger (self-hosted, Cassandra or Badger storage, strong UI for dependency graphs), Grafana Tempo (object storage — S3/GCS, cost-efficient at scale, deep Grafana integration), and commercial platforms (Datadog, Honeycomb, Lightstep) that accept OTLP directly.

Collector pipeline routing by region, with PII redaction:

receivers:
  otlp:
    protocols:
      grpc:
        endpoint: "0.0.0.0:4317"

processors:
  attributes:
    actions:
      - key: "user.email"
        action: delete
      - key: "http.request.body"
        action: delete
  batch:
    timeout: 5s
    send_batch_size: 1000

exporters:
  otlp/eu:
    endpoint: "https://collector.eu-west-1.internal:4317"
  otlp/us:
    endpoint: "https://collector.us-east-1.internal:4317"

service:
  pipelines:
    traces:
      receivers: [otlp]
      processors: [attributes, batch]
      exporters: [otlp/eu, otlp/us]

Apply consistent OTEL_RESOURCE_ATTRIBUTES tagging with cloud.region and cloud.provider to route spans to the correct regional backend and satisfy data-residency requirements.

Failure Modes & Edge Cases

Context loss at async boundaries is the most common production failure. Event loop scheduling and task queuing can detach the active context from the logical execution flow, producing orphaned spans. The fix is explicit context restoration before any async work begins.

Python asyncio with explicit context attachment:

import asyncio
from opentelemetry import context, trace

async def consume_messages(queue):
    current_ctx = context.get_current()  # capture before entering the loop
    while True:
        msg = await queue.get()
        token = context.attach(current_ctx)  # restore before span creation
        span = trace.get_tracer(__name__).start_span("consume-message")
        try:
            await handle_message(msg)
        finally:
            span.end()
            context.detach(token)

See Handling Async Boundaries in Node.js and Python for AsyncLocalStorage patterns and unawaited-promise pitfalls.

Java thread pool context propagation:

ExecutorService executor = Executors.newFixedThreadPool(10);
Context currentContext = Context.current(); // capture on the calling thread

Runnable task = () -> {
    try (Scope scope = currentContext.makeCurrent()) {
        // Context is safely attached to this worker thread
        performBackgroundWork();
    } // scope.close() detaches context — prevents cross-request contamination
};

executor.submit(task); // Context.current().wrap(task) is the convenience form

See Trace Context in Multi-Threaded Environments for thread pool isolation patterns.

High-cardinality attribute explosion — attaching unbounded values (user IDs, order IDs, request body fields) as span attributes causes the storage backend’s index to grow unboundedly. Most backends impose a cardinality limit of 10,000–100,000 unique values per attribute key before query performance degrades. Apply an attributes processor in the collector to drop or hash high-cardinality fields before export.

Clock skew — distributed systems often have sub-second NTP drift between nodes. When the child span’s start timestamp precedes the parent’s start timestamp, Jaeger and Tempo display negative child durations, which breaks waterfall visualizations. Enforce NTP synchronization across all nodes and add a clock-skew threshold in your collector’s spanmetrics configuration.

Propagator mismatch — mixing B3, Jaeger, and W3C propagators without explicit multi-propagator configuration causes silent extraction failures. Services that emit W3C headers and services that emit B3 headers produce orphaned root spans at each handoff. During migration, configure every service with both propagators and a W3C-first extraction order.

Security Considerations

PII in attributes is the most common compliance gap in tracing deployments. Engineers adding diagnostic context often include user emails, session tokens, or request payloads as span attributes — data that then flows into the trace backend unredacted. Apply a collector attributes processor to delete or hash any attribute key in the user.*, http.request.body, or db.statement namespaces before export. See Security Boundaries in Distributed Tracing for a full treatment of trust boundaries and encrypting trace payloads at rest and in transit.

Baggage propagation is a particularly high-risk surface. Baggage is designed to carry application-level metadata across service boundaries, but because it is injected into every outbound header, a large or sensitive baggage value is broadcast to every downstream service in the call chain. Limit baggage to non-sensitive routing hints (tenant ID, feature flag token) and enforce a maximum size at ingress — strip and reject requests where baggage headers exceed 8 KB.

Trust boundaries at ingress — never forward externally-supplied traceparent headers to internal services without validating the format and stripping the sampled flag if your internal sampling policy should take precedence. An attacker who can craft a traceparent with the sampled flag set can force 100% sampling on your backend, exhausting storage. Validate traceparent format at ingress and reset the flag to your internal sampling decision.

Production Readiness Checklist

SDK setup:

TracerProvider initialized before any instrumentation library is imported or HTTP middleware registered
Resource attributes include service.name, service.version, and deployment.environment
Resource detectors (Kubernetes, cloud provider) run synchronously at startup
BatchSpanProcessor configured with appropriate queue size and export timeout for your throughput

Propagation:

W3C TraceContext propagator registered globally
Legacy B3 propagator stacked for any services that have not yet migrated
Ingress validates traceparent format and enforces internal sampling flag
Service mesh header forwarding enabled (Envoy --tracing, Istio meshConfig.enableTracing)

Sampling:

Sampler configured before span creation (not after)
Head-based rate selected for baseline traffic (1–10% for healthy paths)
Tail-based rules capture 100% of error and high-latency traces
decision_wait timeout set on tail-based collector to flush incomplete traces

Export & storage:

OTLP exporter uses mTLS with rotated client certificates
Collector attributes processor redacts PII before export
Collector pipeline routes by cloud.region resource attribute
Backend retention policy configured and tested

Alerting:

Alert on span export failure rate (exporter queue saturation)
Alert on trace completeness ratio (orphaned root span rate)
Alert on traceparent rejection rate at ingress
Alert on collector memory saturation (tail-based buffer overflow)

Troubleshooting FAQ

How does SDK backward compatibility affect production upgrades?

OpenTelemetry SDKs maintain strict semantic versioning. Minor releases introduce backward-compatible feature additions; major releases may deprecate legacy APIs. Always test propagator configurations and exporter endpoints in staging before rolling out SDK upgrades to production. Use feature flags to toggle new instrumentation layers incrementally and watch the orphaned-root-span rate as a leading indicator of propagator regressions.

What is the performance impact of context propagation?

Context propagation adds negligible CPU overhead (under 1% of request latency) when using W3C-compliant propagators. The primary cost is header serialization and context manager allocations. Avoid attaching large payloads to baggage and ensure context restoration does not block the event loop. BatchSpanProcessor with a queue size of 2048 and a 5-second flush interval handles around 5,000 spans/second per CPU core with sub-millisecond overhead.

How should legacy B3 and W3C propagators coexist during migration?

Deploy a multi-propagator configuration during migration. The SDK attempts extraction in priority order — W3C first, then B3. Once all services emit W3C-compliant headers, remove the B3 propagator. Never strip incoming B3 headers at ingress until every downstream dependency supports W3C extraction. Track the ratio of B3-extracted vs W3C-extracted spans in your collector metrics to determine when migration is complete.

OpenTelemetry SDK Setup for Backend Services — language-specific initialization, lifecycle hooks, and credential rotation
Auto-Instrumentation vs Manual Span Creation — benchmark data, memory footprint, and hybrid patterns
Context Propagation Across Service Meshes — Envoy, Linkerd, and Istio header-forwarding configuration
Handling Async Boundaries in Node.js and Python — AsyncLocalStorage, contextvars, and unawaited-promise traps
Trace Context in Multi-Threaded Environments — thread pool isolation and worker context handoff
Baggage Metadata & Routing Workflows — tenant-ID propagation, baggage size constraints, and security controls

↑ Back to Distributed Tracing & Request Correlation Workflows