🧑 About me

I am currently a first-year Ph.D. student at Nanyang Technological University, working under the supervision of Prof. Ziwei Wang in PINE Lab. I received the Master degree at Tsinghua University, where I was advised by Prof. Haoqian Wang. Prior to that, I received my B.Eng. degree in Electronic and Information Engineering from Tongji University.

My current research focuses on Embodied AI, with particular emphasis on world models and vision-language models (VLMs) for robotics.

github / google scholar / cv

🔥 News

2026.06: 🎉🎉 1 paper accepted to IROS 2026 !!!
2026.04: 🎉🎉 1 paper accepted to SIGGRAPH 2026 !!!
2026.03: 🎉🎉 1 paper accepted to CVPR 2026 !!!
2024.12: 🎉🎉 1 paper accepted to AAAI 2025 !!!

💻 Experience

09/2025~03/2026, Work at the ByteDance Seed Robotics.
10/2024~03/2025, Work at the MegviiTech.
05/2023~10/2024, Work at the Tencent Robotics X.

📝 Publications

IROS 2026

RoDyn: Taming Interactive Robot-Dynamic 2.5D World Model for Robotic Manipulation
Chuanrui Zhang, Zhengxian Wu, Guanxing Lu, Yansong Tang, Ziwei Wang
Website Paper Code Coming

- We present RoDyn, a Robot-Dynamic 2.5D world model that bridges the inference efficiency of 2D video models with the spatial awareness of 3D models, establishing SOTA generation fidelity and improving real-world imitation learning success rates by over 42%.

SIGGRAPH 2026

SimArt: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM
Chuanrui Zhang, Minghan Qin†, Yuang Wang, Baifeng Xie, Hang Li, Ziwei Wang†
Website Paper

- SIMART is a unified multimodal large language model framework that converts static 3D meshes into simulation-ready articulated assets.

CVPR 2026

UniPR: Unified Object-level Real-to-Sim Perception and Reconstruction from a Single Stereo Pair
Chuanrui Zhang*, Yingshuang Zou*, Zhengxian Wu, Yonggen Ling†✉, YuxiaoYang, Ziwei Wang✉
Website Paper Code

- We present UniPR, an end-to-end stereo framework that unifies 6D pose estimation and metric-scale 3D shape reconstruction, achieving up to 100× faster generation and 3× better shape-proportion accuracy for real-to-sim robotic manipulation.

AAAI 2026

LMGait: Language-Guided and Motion-Aware Gait Representation for Generalizable Recognition
Zhengxian Wu*, Chuanrui Zhang*, Shenao Jiang*, Hangrui Xu, Zirui Liao, Luyuan Zhang, Huaqiu Li, Peng Jiao, Haoqian Wang
Website Paper Code

- We present LMGait, a Language-Guided and Motion-Aware Gait Representation.

BMVC 2025

MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction
Yingshuang Zou*, Yikang Ding*, Chuanrui Zhang, Xiaoyang Lyu, Feiyang Tan, Xiaojuan Qi, Haoqian Wang†
Website / Paper / Code

- We present MuDG, a controllable Multi-modal Diffusion model with Gaussian Splatting (GS) for Urban Scene Reconstruction.

AAAI 2025

TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers
Chuanrui Zhang*, Yingshuang Zou*, Zhuoling Li, Minmin Yi, Haoqian Wang†
Website Paper Code

- We present TranSplat, a transformer-based approach for generalizable 3D gaussian splatting from sparse multi-view images.

ICME 2025 Oral

DAGait: Generalized Skeleton-Guided Data Alignment for Gait Recognition
Zhengxian Wu*, Chuanrui Zhang*, Hangrui Xu, Peng Jiao, Haoqian Wang
Website Paper Code

- We present DAGait, a universal data alignment strategy for gait recognition, to alleviate spatiotemporal distribution inconsistencies.

ECCV 2024

Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images
Chuanrui Zhang* , Yonggen Ling*†, Minglei Lu, Minghan Qin, Haoqian Wang†
Website Paper Code

- We present CODERS, a one-stage approach for Category-level Object Detection, pose Estimation and Reconstruction from Stereo images.

📄 Preprint Papers

Arxiv

VoxelFormer: Bird’s-Eye-View Feature Generation based on Dual-view Attention for Multi-view 3D Object Detection
Zhuoling Li*, Chuanrui Zhang*, Wei-Chiu Ma, Yipin Zhou, Linyan Huang, Haoqian Wang†, SerNam Lim, Hengshuang Zhao†
Paper Code

- We introduce Dual-View Attention, a more effecient cross attention for Multi-view 3D Object Detection.

🏆 Honors and Awards

Scholarship, Tsinghua University, 2023.
Excellent Graduates, Shanghai, 2022.
Scholarship, Tongji University, 2019-2022.
Nation Scholarship, 2018.

Chuanrui Zhang