Hi there, this is Xingrui! 👋
I am a PhD student in the Computer Science Department at Johns Hopkins University, advised by Prof. Alan Yuille. I am currently also interning with the GenAI group at AMD.
Before JHU, I obtained my Master’s degree from the University of Southern California, where I worked closely with Prof. Laurent Itti. Prior to USC, I received my B.S. in Statistics from Renmin University of China, where I was supervised by Prof. Hanfang Yang. I also spent time in the AI lab at the Samsung R&D Institute, supervised by Dr. Yang Liu.
✉️: xwang378@jhu.edu
📣 News
- Invited talk at BEAM Workshop at CVPR 2025 (slides). (Jun 2025)
- Spatial457 accepted at CVPR 2025 as a highlight paper! (Feb 2025)
- One paper accepted at ICLR 2025. (Jan 2025)
- Introduced NS-4DPhysics and DynSuperCLEVR: a 4D neural symbolic model and benchmark for dynamic spatial reasoning. (May 2024)
- One paper accepted by NeurIPS 2023. (Sep 2023)
- Joined CCVL at JHU as a PhD student. (Aug 2023)
- SuperCLEVR selected as a highlight in CVPR 2023. (Mar 2023)
- One paper accepted at CVPR 2023. (Feb 2023)
Research Interest
My major research interests include 3D scene understanding, particularly in estimating objects’ 3D location and orientation using 3D generative models. Another key direction is 3D spatial reasoning for vision-language models (VLMs), where I am making progress in developing VLMs or benchmarks for 3D and 4D reasoning tasks.
I am also exploring generative models, including video generation and 3D objects generation.
Selected Publications
(Full list available on Google Scholar)





Service
- Invited reviewer for CVPR, ICCV, NeurIPS, ICLR.
- Co-host Advml Workshop at CVPR 2025.