Final-year CS PhD at USC, working with Yue Wang.
I train vision-centric multimodal models that unify generation and understanding. I pursue simple, scalable designs that generalize well — fewer inductive biases, more data, learning directly in raw spaces like pixels. I love simplicity.
“Make things as simple as possible, but not simpler.”