About
I’m Xiaoyu Yan, a data engineer focused on production data systems and data platform reliability.
Before data engineering, I spent years around biology, food science and engineering, epidemiology, and biostatistics. That background still shapes how I think about systems, evidence, and reliability.
This site collects generalized field notes from engineering work and graduate study: production debugging, migration work, runtime behavior, data contracts, Spark execution boundaries, access-control workflows, and information infrastructure.
Outside Engineering
Outside engineering, I spend time birding, photographing nature, practicing violin, and building small personal systems around observation and learning.
Birding
Photography
Violin
Nature