Disheng Liu

PhD Student

Disheng Liu

Disheng Liu

PhD Student

Computer Vision, Vision-Language Models, and Spatial Intelligence.

Email: disheng.liu@case.edu

Personal page: https://dishengll.github.io/

Google Scholar: Profile

Location: VU Lab, Case Western Reserve University, Cleveland, USA

Team page: Back to Team

Disheng Liu is a second-year Ph.D. student in Computer Science at Case Western Reserve University (CWRU), advised by Prof. Yu Yin. His research focuses on Computer Vision and Vision-Language Models, with an emphasis on advancing spatial intelligence in the next generation of AI systems.

Education

  • B.S. in Computing and Information Science, Guangdong University of Technology
  • M.S. in Information Science, University of Pittsburgh (2022)
  • Visiting Student, ShanghaiTech University

Research Interests

  • Computer Vision
  • Vision-Language Models
  • Spatial Intelligence

Selected Publications

  • Spatial Intelligence in Vision-Language Models: A Comprehensive Survey (2025)
  • Balancing Fidelity and Diversity: Synthetic data could stand on the shoulder of the real in visual recognition (2025)
  • CAUSAL3D: A Comprehensive Benchmark for Causal Learning from Visual Data (2025)

Experience

  • Research Intern, ShanghaiTech IDEA Lab (2023–2024)
  • Algorithm Engineer, Yinwang Intelligent Technology (2022–2023)

Academic Service

  • Reviewer, ICLR 2026
  • Reviewer, CVPR 2026

Publications

  1. GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning.
    Yiren Lu, Yi Du, Disheng Liu, Yunlai Zhou, Chen Wang and Yu Yin.
    In arXiv preprint arXiv:2603.19137, 2026.

    @article{lu2026gsmem,
      title = {GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning},
      author = {Lu, Yiren and Du, Yi and Liu, Disheng and Zhou, Yunlai and Wang, Chen and Yin, Yu},
      journal = {arXiv preprint arXiv:2603.19137},
      year = {2026},
      status = {preprint},
      pdf = {https://arxiv.org/pdf/2603.19137.pdf},
      website = {https://yiren-lu.com/project_pages/GSMem/}
    }
    
  2. Spatial Intelligence in Vision-Language Models: A Comprehensive Survey.
    Disheng Liu, Tuo Liang, Zhe Hu, Jierui Peng, Yiren Lu, Yi Xu, Yun Fu and Yu Yin.
    In TechRxiv, 2026.

    @article{liu2026spatial,
      title = {Spatial Intelligence in Vision-Language Models: A Comprehensive Survey},
      author = {Liu, Disheng and Liang, Tuo and Hu, Zhe and Peng, Jierui and Lu, Yiren and Xu, Yi and Fu, Yun and Yin, Yu},
      journal = {TechRxiv},
      year = {2026},
      status = {preprint},
      pdf = {https://www.techrxiv.org/doi/full/10.36227/techrxiv.176231405.57942913/v2},
      website = {https://dishengll.github.io/Awesome-Spatial-VLMs/}
    }
    
  3. Counterfactual Visual Explanation via Causally-Guided Adversarial Steering.
    Yiran Qiao, Disheng Liu, Yiren Lu, Yu Yin, Mengnan Du and Jing Ma.
    In arXiv preprint arXiv:2507.09881, 2025.

    @article{qiao2025counterfactual,
      title = {Counterfactual Visual Explanation via Causally-Guided Adversarial Steering},
      author = {Qiao, Yiran and Liu, Disheng and Lu, Yiren and Yin, Yu and Du, Mengnan and Ma, Jing},
      journal = {arXiv preprint arXiv:2507.09881},
      year = {2025},
      status = {preprint},
      pdf = {https://arxiv.org/pdf/2507.09881.pdf}
    }
    
  4. When ’YES’ Meets ’BUT’: Can Large Models Comprehend Contradictory Humor Through Comparative Reasoning?
    Tuo Liang, Zhe Hu, Jing Li, Hao Zhang, Yiren Lu, Yunlai Zhou, Yiran Qiao, Disheng Liu, Jierui Peng, Jing Ma and others.
    In arXiv preprint arXiv:2503.23137, 2025.

    @article{liang2025yesbut,
      title = {When 'YES' Meets 'BUT': Can Large Models Comprehend Contradictory Humor Through Comparative Reasoning?},
      author = {Liang, Tuo and Hu, Zhe and Li, Jing and Zhang, Hao and Lu, Yiren and Zhou, Yunlai and Qiao, Yiran and Liu, Disheng and Peng, Jierui and Ma, Jing and others},
      journal = {arXiv preprint arXiv:2503.23137},
      year = {2025},
      status = {preprint},
      pdf = {https://arxiv.org/pdf/2503.23137.pdf}
    }
    
  5. Causal3D: A Comprehensive Benchmark for Causal Learning from Visual Data.
    Disheng Liu, Yiran Qiao, Wuche Liu, Yiren Lu, Yunlai Zhou, Tuo Liang, Yu Yin and Jing Ma.
    In arXiv preprint arXiv:2503.04852, 2025.

    @article{liu2025causal3d,
      title = {Causal3D: A Comprehensive Benchmark for Causal Learning from Visual Data},
      author = {Liu, Disheng and Qiao, Yiran and Liu, Wuche and Lu, Yiren and Zhou, Yunlai and Liang, Tuo and Yin, Yu and Ma, Jing},
      journal = {arXiv preprint arXiv:2503.04852},
      year = {2025},
      status = {preprint},
      pdf = {https://arxiv.org/pdf/2503.04852.pdf},
      data = {https://huggingface.co/datasets/LLDDSS/Causal3D_Dataset}
    }
    
  6. BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting.
    Yiren Lu, Yunlai Zhou, Disheng Liu, Tuo Liang and Yu Yin.
    In Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), pp. 16532–16542, 2025.

    @inproceedings{lu2025bard,
      title = {BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting},
      author = {Lu, Yiren and Zhou, Yunlai and Liu, Disheng and Liang, Tuo and Yin, Yu},
      booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)},
      pages = {16532--16542},
      year = {2025},
      status = {accepted},
      pdf = {https://arxiv.org/pdf/2503.15835},
      website = {https://yiren-lu.com/project_pages/BARD-GS/},
      code = {https://github.com/luyr/BARD-GS},
      data = {https://drive.google.com/drive/u/0/folders/1CRBQ_HR3yKhT3G9_ttTWA1PWXWL6DtsV}
    }