Now
What I'm Up To
I'm wrapping up my time at the Center for Human-Compatible AI, where I've been working on understanding how transformer language models represent entities in natural language generation.
Recent
Our paper CoT Red-Handed: Stress Testing Chain-of-Thought Monitoring was accepted to NeurIPS 2025!
Based In
Bay Area, California