Distill to Detect: Exposing Stealth Biases in LLMs through Cartridge Distillation
ICML 2026 Workshops · TAIGR, Mech Interp, CoLorAI, AI4Good
I'm a master's student in Computer Science at Stanford University, advised by Amin Saberi. I also work closely with Azalia Mirhoseini and Monica Lam. My research focuses on reasoning, post-training, and interpretability in large language models.
I'm currently a research scientist intern at Foundation AI (Cisco). Before Stanford, I received my B.S. in Computer Science from the University of Illinois Urbana-Champaign, where I worked with Dilek Hakkani-Tür and Heng Ji.
Contact: achinta [at] stanford.edu
ICML 2026 Workshops · TAIGR, Mech Interp, CoLorAI, AI4Good
ICML 2025
EMNLP 2024
EMNLP 2023 · Findings
* denotes equal contribution