Tag Index
Tag: Data Reliability
Correctness, recovery, and reliable production data behavior.
- When Final Output Diff Is Not Rewrite Diff
Freezing input boundaries to separate rewrite behavior from upstream identity drift.
- Shape Parity Is Not Semantic Parity
When base group counts stay stable but derived totals drift, downstream filters can collapse the output without a failed job.
- What the OSI Model Taught Me About System Boundaries
How a networking model became a way to reason about debugging, ownership, and failure boundaries in production systems.
- When Python UDF Becomes the Memory Boundary
Why grouped Python logic works until workload shape stops being bounded.
- When Support Data Becomes Runtime Infrastructure
How a correct output hid repeated work in a production data pipeline.
- Rerun Scope Is Part of the Data Contract
Why rerun scope, support-data snapshots, and processing windows must be explicit in batch workflows.