Hi there, this is Xingrui! 👋

I am a PhD student in the Computer Science Department at Johns Hopkins University, advised by Prof. Alan Yuille.

Before JHU, I obtained my Master’s degree from the University of Southern California, where I worked closely with Prof. Laurent Itti. Prior to USC, I received my B.S. in Statistics from Renmin University of China, where I was supervised by Prof. Hanfang Yang. I have also conducted research internships with the GenAI group at AMD and at the Samsung R&D Institute. I am deeply grateful to my collaborators and mentors for their support and insightful discussions throughout my research journey.

My research focuses on AI systems with 3D spatial reasoning and multimodal understanding capabilities. Specifically, most of my previous work has focused on aligning 3D / 4D knowledge with language models, as well as modality fusion in audio-visual tasks for both generation and understanding purpose. Ultimately, I strive to integrate these fields to develop more advanced AI systems that can truly understand and engage with the 4D physical world.

I am actively seeking potential collaborations. If you are an undergraduate or Master’s student who shares similar interests, I would love to have a chat!

✉️: xwang378@jhu.edu

JHU
USC
PKU
RUC

📣 News

Show older news

Selected Publications

(Full list available on Google Scholar)

Service

  • Invited reviewer for conferences CVPR, ICCV, NeurIPS, ICLR, WACV.
  • Co-host Advml Workshop at CVPR 2025.