Third-year CS PhD at USC, working with Yue Wang.
One model should be enough — to see, to imagine, to understand, to create. Perception and generation are two sides of the same coin. I work on unifying both into something simple, scalable, and real, with a focus on vision-centric unified multimodal systems.
I believe in the elegance of less. Simplicity is the ultimate conviction.
“Make things as simple as possible, but not simpler.”
My earlier work spans 3D/4D scene reconstruction, self-supervised learning, and visual representation. Long live representation learning!