Final-year CS PhD at USC, working with Yue Wang.
I chase simple designs that scale and generalize — more intuitive, fewer inductive biases, fewer dependencies. These days that looks like vision-centric multimodal models: tokenization, generation and understanding in one system, learned end-to-end in raw pixels.
“Make things as simple as possible, but not simpler.”