Research
Explorations, notes, and findings.
Research
Evaluating LLMs in Medicine: What We Got Wrong
[Placeholder] A look at why standard benchmarks overstate clinical readiness — and the evaluation protocols we're building instead.
Exploration
Notes on Calibration for Trustworthy Models
[Placeholder] Why a model's confidence should mean something — and a few practical recipes for getting there.
Announcement
Introducing the Computational Thinking Lab
[Placeholder] Who we are, what we work on, and how to collaborate with us.
No posts in this category yet.
Own your AI future