Notes
Research notes, engineering lessons, and ideas in progress.
Short technical writing on ML reliability, numerical debugging, GPU portability, software engineering, and the practical decisions behind research systems.
May 1, 2026
What I Want This Blog to Become
I want this blog to be a working notebook for the problems I keep returning to: machine learning reliability, numerical instability, debugging, and the engineering decisions tha...
Read note July 8, 2025Automatically Detecting Numerical Instability in ML via Soft Assertions
Machine learning (ML) models run on massive datasets and often perform billions of floating-point calculations. But here’s the problem: small numerical errors can snowball into ...
Read note July 7, 2025Testing GPU Numerics: Finding Numerical Differences Between NVIDIA and AMD GPUs
When you run the same GPU program on an NVIDIA GPU and an AMD GPU, you might expect identical results. Surprisingly, that’s not always the case — even small floating-point diffe...
Read note