Disheng Liu
PhD Student
Computer Vision, Vision-Language Models, and Spatial Intelligence.
Disheng Liu is a second-year Ph.D. student in Computer Science at Case Western Reserve University (CWRU), advised by Prof. Yu Yin. His research focuses on Computer Vision and Vision-Language Models, with an emphasis on advancing spatial intelligence in the next generation of AI systems.
Education
- B.S. in Computing and Information Science, Guangdong University of Technology
- M.S. in Information Science, University of Pittsburgh (2022)
- Visiting Student, ShanghaiTech University
Research Interests
- Computer Vision
- Vision-Language Models
- Spatial Intelligence
Selected Publications
- Spatial Intelligence in Vision-Language Models: A Comprehensive Survey (2025)
- Balancing Fidelity and Diversity: Synthetic data could stand on the shoulder of the real in visual recognition (2025)
- CAUSAL3D: A Comprehensive Benchmark for Causal Learning from Visual Data (2025)
Experience
- Research Intern, ShanghaiTech IDEA Lab (2023–2024)
- Algorithm Engineer, Yinwang Intelligent Technology (2022–2023)
Academic Service
- Reviewer, ICLR 2026
- Reviewer, CVPR 2026
Publications
- GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning.In arXiv preprint arXiv:2603.19137, 2026.
@article{lu2026gsmem, title = {GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning}, author = {Lu, Yiren and Du, Yi and Liu, Disheng and Zhou, Yunlai and Wang, Chen and Yin, Yu}, journal = {arXiv preprint arXiv:2603.19137}, year = {2026}, status = {preprint}, pdf = {https://arxiv.org/pdf/2603.19137.pdf}, website = {https://yiren-lu.com/project_pages/GSMem/} } -
@article{liu2026spatial, title = {Spatial Intelligence in Vision-Language Models: A Comprehensive Survey}, author = {Liu, Disheng and Liang, Tuo and Hu, Zhe and Peng, Jierui and Lu, Yiren and Xu, Yi and Fu, Yun and Yin, Yu}, journal = {TechRxiv}, year = {2026}, status = {preprint}, pdf = {https://www.techrxiv.org/doi/full/10.36227/techrxiv.176231405.57942913/v2}, website = {https://dishengll.github.io/Awesome-Spatial-VLMs/} } - Counterfactual Visual Explanation via Causally-Guided Adversarial Steering.In arXiv preprint arXiv:2507.09881, 2025.
@article{qiao2025counterfactual, title = {Counterfactual Visual Explanation via Causally-Guided Adversarial Steering}, author = {Qiao, Yiran and Liu, Disheng and Lu, Yiren and Yin, Yu and Du, Mengnan and Ma, Jing}, journal = {arXiv preprint arXiv:2507.09881}, year = {2025}, status = {preprint}, pdf = {https://arxiv.org/pdf/2507.09881.pdf} } - When ’YES’ Meets ’BUT’: Can Large Models Comprehend Contradictory Humor Through Comparative Reasoning?In arXiv preprint arXiv:2503.23137, 2025.
@article{liang2025yesbut, title = {When 'YES' Meets 'BUT': Can Large Models Comprehend Contradictory Humor Through Comparative Reasoning?}, author = {Liang, Tuo and Hu, Zhe and Li, Jing and Zhang, Hao and Lu, Yiren and Zhou, Yunlai and Qiao, Yiran and Liu, Disheng and Peng, Jierui and Ma, Jing and others}, journal = {arXiv preprint arXiv:2503.23137}, year = {2025}, status = {preprint}, pdf = {https://arxiv.org/pdf/2503.23137.pdf} } - Causal3D: A Comprehensive Benchmark for Causal Learning from Visual Data.In arXiv preprint arXiv:2503.04852, 2025.
@article{liu2025causal3d, title = {Causal3D: A Comprehensive Benchmark for Causal Learning from Visual Data}, author = {Liu, Disheng and Qiao, Yiran and Liu, Wuche and Lu, Yiren and Zhou, Yunlai and Liang, Tuo and Yin, Yu and Ma, Jing}, journal = {arXiv preprint arXiv:2503.04852}, year = {2025}, status = {preprint}, pdf = {https://arxiv.org/pdf/2503.04852.pdf}, data = {https://huggingface.co/datasets/LLDDSS/Causal3D_Dataset} } - BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting.In Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), pp. 16532–16542, 2025.
@inproceedings{lu2025bard, title = {BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting}, author = {Lu, Yiren and Zhou, Yunlai and Liu, Disheng and Liang, Tuo and Yin, Yu}, booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)}, pages = {16532--16542}, year = {2025}, status = {accepted}, pdf = {https://arxiv.org/pdf/2503.15835}, website = {https://yiren-lu.com/project_pages/BARD-GS/}, code = {https://github.com/luyr/BARD-GS}, data = {https://drive.google.com/drive/u/0/folders/1CRBQ_HR3yKhT3G9_ttTWA1PWXWL6DtsV} }