Research

Explorations, notes, and findings.

[Placeholder] A look at why standard benchmarks overstate clinical readiness — and the evaluation protocols we're building instead.

[Placeholder] Why a model's confidence should mean something — and a few practical recipes for getting there.

[Placeholder] Who we are, what we work on, and how to collaborate with us.

Own your AI future

Want to collaborate on research?