Disheng Liu
PhD Student
Computer Vision, Vision-Language Models, and Spatial Intelligence.
Disheng Liu is a second-year Ph.D. student in Computer Science at Case Western Reserve University (CWRU), advised by Prof. Yu Yin. His research focuses on Computer Vision and Vision-Language Models, with an emphasis on advancing spatial intelligence in the next generation of AI systems.
Education
- B.S. in Computing and Information Science, Guangdong University of Technology
- M.S. in Information Science, University of Pittsburgh (2022)
- Visiting Student, ShanghaiTech University
Research Interests
- Computer Vision
- Vision-Language Models
- Spatial Intelligence
Selected Publications
- Spatial Intelligence in Vision-Language Models: A Comprehensive Survey (2025)
- Balancing Fidelity and Diversity: Synthetic data could stand on the shoulder of the real in visual recognition (2025)
- CAUSAL3D: A Comprehensive Benchmark for Causal Learning from Visual Data (2025)
Experience
- Research Intern, ShanghaiTech IDEA Lab (2023–2024)
- Algorithm Engineer, Yinwang Intelligent Technology (2022–2023)
Academic Service
- Reviewer, ICLR 2026
- Reviewer, CVPR 2026
Publications
- GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning.In arXiv preprint arXiv:2603.19137, 2026.
@article{lu2026gsmem, title = {GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning}, author = {Lu, Yiren and Du, Yi and Liu, Disheng and Zhou, Yunlai and Wang, Chen and Yin, Yu}, journal = {arXiv preprint arXiv:2603.19137}, year = {2026}, status = {preprint}, pdf = {https://arxiv.org/pdf/2603.19137.pdf}, website = {https://yiren-lu.com/project_pages/GSMem/} } -
@article{liu2026spatial, title = {Spatial Intelligence in Vision-Language Models: A Comprehensive Survey}, author = {Liu, Disheng and Liang, Tuo and Hu, Zhe and Peng, Jierui and Lu, Yiren and Xu, Yi and Fu, Yun and Yin, Yu}, journal = {TechRxiv}, year = {2026}, status = {preprint}, pdf = {https://www.techrxiv.org/doi/full/10.36227/techrxiv.176231405.57942913/v2}, website = {https://github.com/vulab-AI/Awesome-Spatial-VLMs} } - When ’YES’ Meets ’BUT’: Can AI Comprehend Contradictory Humor in Comics?In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2026.(Impact Factor: 20.4)
@article{liang2026yesbut, title = {When 'YES' Meets 'BUT': Can AI Comprehend Contradictory Humor in Comics?}, author = {Liang, Tuo and Hu, Zhe and Li, Jing and Zhang, Hao and Lu, Yiren and Zhou, Yunlai and Qiao, Yiran and Liu, Disheng and Peng, Jierui and Ma, Jing and Yin, Yu}, journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)}, year = {2026}, doi = {10.1109/TPAMI.2026.3688191}, note = {Impact Factor: 20.4}, status = {accepted}, pdf = {https://arxiv.org/pdf/2503.23137.pdf}, website = {/projects/yesbut-v2/}, data = {https://huggingface.co/datasets/zhehuderek/YESBUT_Benchmark} } - BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting.In Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), pp. 16532–16542, 2025.
@inproceedings{lu2025bard, title = {BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting}, author = {Lu, Yiren and Zhou, Yunlai and Liu, Disheng and Liang, Tuo and Yin, Yu}, booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)}, pages = {16532--16542}, year = {2025}, status = {accepted}, pdf = {https://arxiv.org/pdf/2503.15835}, website = {https://yiren-lu.com/project_pages/BARD-GS/}, code = {https://github.com/luyr/BARD-GS}, data = {https://drive.google.com/drive/u/0/folders/1CRBQ_HR3yKhT3G9_ttTWA1PWXWL6DtsV} } - Counterfactual Visual Explanation via Causally-Guided Adversarial Steering.In arXiv preprint arXiv:2507.09881, 2025.
@article{qiao2025counterfactual, title = {Counterfactual Visual Explanation via Causally-Guided Adversarial Steering}, author = {Qiao, Yiran and Liu, Disheng and Lu, Yiren and Yin, Yu and Du, Mengnan and Ma, Jing}, journal = {arXiv preprint arXiv:2507.09881}, year = {2025}, status = {preprint}, pdf = {https://arxiv.org/pdf/2507.09881.pdf} } - Causal3D: A Comprehensive Benchmark for Causal Learning from Visual Data.In arXiv preprint arXiv:2503.04852, 2025.
@article{liu2025causal3d, title = {Causal3D: A Comprehensive Benchmark for Causal Learning from Visual Data}, author = {Liu, Disheng and Qiao, Yiran and Liu, Wuche and Lu, Yiren and Zhou, Yunlai and Liang, Tuo and Yin, Yu and Ma, Jing}, journal = {arXiv preprint arXiv:2503.04852}, year = {2025}, status = {preprint}, pdf = {https://arxiv.org/pdf/2503.04852.pdf}, data = {https://huggingface.co/datasets/LLDDSS/Causal3D_Dataset} }