Disheng Liu

Personal page: https://dishengll.github.io/

Google Scholar: Profile

Location: VU Lab, Case Western Reserve University, Cleveland, USA

Disheng Liu is a second-year Ph.D. student in Computer Science at Case Western Reserve University (CWRU), advised by Prof. Yu Yin. His research focuses on Computer Vision and Vision-Language Models, with an emphasis on advancing spatial intelligence in the next generation of AI systems.

Education

B.S. in Computing and Information Science, Guangdong University of Technology
M.S. in Information Science, University of Pittsburgh (2022)
Visiting Student, ShanghaiTech University

Research Interests

Computer Vision
Vision-Language Models
Spatial Intelligence

Selected Publications

Spatial Intelligence in Vision-Language Models: A Comprehensive Survey (2025)
Balancing Fidelity and Diversity: Synthetic data could stand on the shoulder of the real in visual recognition (2025)
CAUSAL3D: A Comprehensive Benchmark for Causal Learning from Visual Data (2025)

Experience

Research Intern, ShanghaiTech IDEA Lab (2023–2024)
Algorithm Engineer, Yinwang Intelligent Technology (2022–2023)

Academic Service

Reviewer, ICLR 2026
Reviewer, CVPR 2026

Publications

GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning.
Yiren Lu, Yi Du, Disheng Liu, Yunlai Zhou, Chen Wang and Yu Yin.
In arXiv preprint arXiv:2603.19137, 2026.
```
@article{lu2026gsmem,
  title = {GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning},
  author = {Lu, Yiren and Du, Yi and Liu, Disheng and Zhou, Yunlai and Wang, Chen and Yin, Yu},
  journal = {arXiv preprint arXiv:2603.19137},
  year = {2026},
  status = {preprint},
  pdf = {https://arxiv.org/pdf/2603.19137.pdf},
  website = {https://yiren-lu.com/project_pages/GSMem/}
}
```
Spatial Intelligence in Vision-Language Models: A Comprehensive Survey.
Disheng Liu, Tuo Liang, Zhe Hu, Jierui Peng, Yiren Lu, Yi Xu, Yun Fu and Yu Yin.
In TechRxiv, 2026.
```
@article{liu2026spatial,
  title = {Spatial Intelligence in Vision-Language Models: A Comprehensive Survey},
  author = {Liu, Disheng and Liang, Tuo and Hu, Zhe and Peng, Jierui and Lu, Yiren and Xu, Yi and Fu, Yun and Yin, Yu},
  journal = {TechRxiv},
  year = {2026},
  status = {preprint},
  pdf = {https://www.techrxiv.org/doi/full/10.36227/techrxiv.176231405.57942913/v2},
  website = {https://github.com/vulab-AI/Awesome-Spatial-VLMs}
}
```
When ’YES’ Meets ’BUT’: Can AI Comprehend Contradictory Humor in Comics?
Tuo Liang, Zhe Hu, Jing Li, Hao Zhang, Yiren Lu, Yunlai Zhou, Yiran Qiao, Disheng Liu, Jierui Peng, Jing Ma and Yu Yin.
In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2026.
(Impact Factor: 20.4)
```
@article{liang2026yesbut,
  title = {When 'YES' Meets 'BUT': Can AI Comprehend Contradictory Humor in Comics?},
  author = {Liang, Tuo and Hu, Zhe and Li, Jing and Zhang, Hao and Lu, Yiren and Zhou, Yunlai and Qiao, Yiran and Liu, Disheng and Peng, Jierui and Ma, Jing and Yin, Yu},
  journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)},
  year = {2026},
  doi = {10.1109/TPAMI.2026.3688191},
  note = {Impact Factor: 20.4},
  status = {accepted},
  pdf = {https://arxiv.org/pdf/2503.23137.pdf},
  website = {/projects/yesbut-v2/},
  data = {https://huggingface.co/datasets/zhehuderek/YESBUT_Benchmark}
}
```
BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting.
Yiren Lu, Yunlai Zhou, Disheng Liu, Tuo Liang and Yu Yin.
In Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), pp. 16532–16542, 2025.
```
@inproceedings{lu2025bard,
  title = {BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting},
  author = {Lu, Yiren and Zhou, Yunlai and Liu, Disheng and Liang, Tuo and Yin, Yu},
  booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)},
  pages = {16532--16542},
  year = {2025},
  status = {accepted},
  pdf = {https://arxiv.org/pdf/2503.15835},
  website = {https://yiren-lu.com/project_pages/BARD-GS/},
  code = {https://github.com/luyr/BARD-GS},
  data = {https://drive.google.com/drive/u/0/folders/1CRBQ_HR3yKhT3G9_ttTWA1PWXWL6DtsV}
}
```
Counterfactual Visual Explanation via Causally-Guided Adversarial Steering.
Yiran Qiao, Disheng Liu, Yiren Lu, Yu Yin, Mengnan Du and Jing Ma.
In arXiv preprint arXiv:2507.09881, 2025.
```
@article{qiao2025counterfactual,
  title = {Counterfactual Visual Explanation via Causally-Guided Adversarial Steering},
  author = {Qiao, Yiran and Liu, Disheng and Lu, Yiren and Yin, Yu and Du, Mengnan and Ma, Jing},
  journal = {arXiv preprint arXiv:2507.09881},
  year = {2025},
  status = {preprint},
  pdf = {https://arxiv.org/pdf/2507.09881.pdf}
}
```
Causal3D: A Comprehensive Benchmark for Causal Learning from Visual Data.
Disheng Liu, Yiran Qiao, Wuche Liu, Yiren Lu, Yunlai Zhou, Tuo Liang, Yu Yin and Jing Ma.
In arXiv preprint arXiv:2503.04852, 2025.
```
@article{liu2025causal3d,
  title = {Causal3D: A Comprehensive Benchmark for Causal Learning from Visual Data},
  author = {Liu, Disheng and Qiao, Yiran and Liu, Wuche and Lu, Yiren and Zhou, Yunlai and Liang, Tuo and Yin, Yu and Ma, Jing},
  journal = {arXiv preprint arXiv:2503.04852},
  year = {2025},
  status = {preprint},
  pdf = {https://arxiv.org/pdf/2503.04852.pdf},
  data = {https://huggingface.co/datasets/LLDDSS/Causal3D_Dataset}
}
```