From the notebook
Everything I've written so far. Newest first.
- Probing medical LLMs for overconfidenceMay 2026
- From 0.50 to 0.88 AUROC with activation normalizationMay 2026
- Monitoring interpretability in productionApr 2026
- What I learned building ExoSeekerJan 2025