Hi there, this is Xingrui! 👋
I am a PhD student in the Computer Science Department at Johns Hopkins University, advised by Prof. Alan Yuille.
Before JHU, I obtained my Master’s degree from the University of Southern California, where I worked closely with Prof. Laurent Itti. Prior to USC, I received my B.S. in Statistics from Renmin University of China, where I was supervised by Prof. Hanfang Yang. I have also conducted research internships with the GenAI group at AMD and at the Samsung R&D Institute. I am deeply grateful to my collaborators and mentors for their support and insightful discussions throughout my research journey.
My research focuses on AI systems with 3D spatial reasoning and multimodal understanding capabilities. Specifically, most of my previous work has focused on aligning 3D / 4D knowledge with language models, as well as modality fusion in audio-visual tasks for both generation and understanding purpose. Ultimately, I strive to integrate these fields to develop more advanced AI systems that can truly understand and engage with the 4D physical world.
I am actively seeking potential collaborations. If you are an undergraduate or Master’s student who shares similar interests, I would love to have a chat!
✉️: xwang378@jhu.edu
📣 News
- XModBench has been accepted by ICLR 2026! Please stay tuned for the code release.
- Heading to ICCV 2025 🏖️ to present at the GEN4AVM workshop.
- SpatialReasoner accepted by NeurIPS 2025.
- Invited talk at BEAM Workshop at CVPR 2025 (slides). (Jun 2025)
- Spatial457 accepted at CVPR 2025 as a highlight paper! (Feb 2025)
- One paper accepted at ICLR 2025. (Jan 2025)
- Introduced NS-4DPhysics and DynSuperCLEVR: a 4D neural symbolic model and benchmark for dynamic spatial reasoning. (May 2024)
Show older news
- One paper accepted by NeurIPS 2023. (Sep 2023)
- Joined CCVL at JHU as a PhD student. (Aug 2023)
- SuperCLEVR selected as a highlight in CVPR 2023. (Mar 2023)
- One paper accepted at CVPR 2023. (Feb 2023)
- One paper accepted at ECCV 2022
Selected Publications
(Full list available on Google Scholar)







Service
- Invited reviewer for conferences CVPR, ICCV, NeurIPS, ICLR, WACV.
- Co-host Advml Workshop at CVPR 2025.






