Tuo Liang

PhD Student

Tuo Liang

Tuo Liang

PhD Student

Computer Vision, Vision-Language Models, and Visual Reasoning.

Email: tuo.liang@case.edu

Personal page: https://tuo-liang.github.io/

Google Scholar: Profile

Location: VU Lab, Case Western Reserve University, Cleveland, USA

Team page: Back to Team

Tuo Liang is a Ph.D. student in Computer Science at Case Western Reserve University (CWRU), advised by Prof. Yu Yin. He received his M.S. from Case Western Reserve University, Cleveland, OH and his B.A. from Sun Yat-Sen University (SYSU), GuangZhou, China. His research interests include vision-language models, VLM Reasoning, and multimodal learning.

Education

  • B.S. in Math (Information and Computing Science), Sun Yat-sen University, GuangZhou (2018-2022)
  • M.S. in Computer Science, Case Western Reserve University, Cleveland (2023-2025)
  • Ph.D. Student, Case Western Reserve University, Cleveland (2025-Now)

Research Interests

  • Computer Vision
  • Multi-modality learning
  • Spatial Intelligence
  • Visual Reasoning

Selected Publications

  • Zhe Hu*, Tuo Liang*, Jing Li, Yiren Lu, Yunlai Zhou, Yiran Qiao, Jing Ma, and Yu Yin, “Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions”, NeurIPS 2024 (Oral).

  • Tuo Liang*, Zhe Hu*, Jing Li, H Zhang, Y Lu, Y Zhou, Y Qiao, D Liu, J Peng, J Ma, “When ‘YES’ Meets ‘BUT’: Can Large Models Comprehend Contradictory Humor Through Comparative Reasoning?”, Submitted to TPAMI.

Experience

  • Machine Learning Researcher (Intern), Electric-Wisdom Technology Co. (Feb 2023 – Jul 2023)

Academic Service

  • Conference Reviewer (ICLR, ACL, NeurIPS)
  • Journal Reviewer (TPAMI)

Publications

  1. Assessing LLMs for Serendipity Discovery in Knowledge Graphs: A Case for Drug Repurposing.
    Meng Wang, Chang Ma, Aoran Jiao, Tuo Liang, Pengfei Lu, Saanvi Hegde, Yu Yin and others.
    In Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 19, 2026.

    @inproceedings{wang2026serendipity,
      title = {Assessing LLMs for Serendipity Discovery in Knowledge Graphs: A Case for Drug Repurposing},
      author = {Wang, Meng and Ma, Chang and Jiao, Aoran and Liang, Tuo and Lu, Pengfei and Hegde, Saanvi and Yin, Yu and others},
      booktitle = {Proceedings of the AAAI Conference on Artificial Intelligence},
      volume = {40},
      number = {19},
      year = {2026},
      status = {accepted},
      pdf = {https://ojs.aaai.org/index.php/AAAI/article/view/38618}
    }
    
  2. Spatial Intelligence in Vision-Language Models: A Comprehensive Survey.
    Disheng Liu, Tuo Liang, Zhe Hu, Jierui Peng, Yiren Lu, Yi Xu, Yun Fu and Yu Yin.
    In TechRxiv, 2026.

    @article{liu2026spatial,
      title = {Spatial Intelligence in Vision-Language Models: A Comprehensive Survey},
      author = {Liu, Disheng and Liang, Tuo and Hu, Zhe and Peng, Jierui and Lu, Yiren and Xu, Yi and Fu, Yun and Yin, Yu},
      journal = {TechRxiv},
      year = {2026},
      status = {preprint},
      pdf = {https://www.techrxiv.org/doi/full/10.36227/techrxiv.176231405.57942913/v2},
      website = {https://dishengll.github.io/Awesome-Spatial-VLMs/}
    }
    
  3. HugRAG: Hierarchical Causal Knowledge Graph Design for RAG.
    Nian Wang, Tuo Liang, Varun Singh, Chaoda Song, Vivian Yang, Yu Yin, Jing Ma, Jaideep Singh and others.
    In arXiv preprint arXiv:2602.05143, 2026.

    @article{wang2026hugrag,
      title = {HugRAG: Hierarchical Causal Knowledge Graph Design for RAG},
      author = {Wang, Nian and Liang, Tuo and Singh, Varun and Song, Chaoda and Yang, Vivian and Yin, Yu and Ma, Jing and Singh, Jaideep and others},
      journal = {arXiv preprint arXiv:2602.05143},
      year = {2026},
      status = {preprint},
      pdf = {https://arxiv.org/pdf/2602.05143.pdf}
    }
    
  4. Nebula: Do we Evaluate Vision-Language-Action Agents Correctly?
    Jierui Peng, Yanyan Zhang, Yicheng Duan, Tuo Liang, Vipin Chaudhary and Yu Yin.
    In arXiv preprint arXiv:2510.16263, 2025.

    @article{peng2025nebula,
      title = {Nebula: Do we Evaluate Vision-Language-Action Agents Correctly?},
      author = {Peng, Jierui and Zhang, Yanyan and Duan, Yicheng and Liang, Tuo and Chaudhary, Vipin and Yin, Yu},
      journal = {arXiv preprint arXiv:2510.16263},
      year = {2025},
      status = {preprint},
      pdf = {https://arxiv.org/pdf/2510.16263.pdf}
    }
    
  5. ResSVD: Residual Compensated SVD for Large Language Model Compression.
    Hongyu Bai, Shuo Jian, Tuo Liang, Yu Yin and Huan Wang.
    In Conference on Parsimony and Learning (CPAL), 2026.

    @article{bai2025ressvd,
      title = {ResSVD: Residual Compensated SVD for Large Language Model Compression},
      author = {Bai, Hongyu and Jian, Shuo and Liang, Tuo and Yin, Yu and Wang, Huan},
      journal = {Conference on Parsimony and Learning (CPAL)},
      year = {2026},
      status = {accepted},
      pdf = {https://arxiv.org/pdf/2505.20112.pdf}
    }
    
  6. When ’YES’ Meets ’BUT’: Can Large Models Comprehend Contradictory Humor Through Comparative Reasoning?
    Tuo Liang, Zhe Hu, Jing Li, Hao Zhang, Yiren Lu, Yunlai Zhou, Yiran Qiao, Disheng Liu, Jierui Peng, Jing Ma and others.
    In arXiv preprint arXiv:2503.23137, 2025.

    @article{liang2025yesbut,
      title = {When 'YES' Meets 'BUT': Can Large Models Comprehend Contradictory Humor Through Comparative Reasoning?},
      author = {Liang, Tuo and Hu, Zhe and Li, Jing and Zhang, Hao and Lu, Yiren and Zhou, Yunlai and Qiao, Yiran and Liu, Disheng and Peng, Jierui and Ma, Jing and others},
      journal = {arXiv preprint arXiv:2503.23137},
      year = {2025},
      status = {preprint},
      pdf = {https://arxiv.org/pdf/2503.23137.pdf}
    }
    
  7. Causal3D: A Comprehensive Benchmark for Causal Learning from Visual Data.
    Disheng Liu, Yiran Qiao, Wuche Liu, Yiren Lu, Yunlai Zhou, Tuo Liang, Yu Yin and Jing Ma.
    In arXiv preprint arXiv:2503.04852, 2025.

    @article{liu2025causal3d,
      title = {Causal3D: A Comprehensive Benchmark for Causal Learning from Visual Data},
      author = {Liu, Disheng and Qiao, Yiran and Liu, Wuche and Lu, Yiren and Zhou, Yunlai and Liang, Tuo and Yin, Yu and Ma, Jing},
      journal = {arXiv preprint arXiv:2503.04852},
      year = {2025},
      status = {preprint},
      pdf = {https://arxiv.org/pdf/2503.04852.pdf},
      data = {https://huggingface.co/datasets/LLDDSS/Causal3D_Dataset}
    }
    
  8. Segment then Splat: Unified 3D Open-Vocabulary Segmentation via Gaussian Splatting.
    Yiren Lu, Yunlai Zhou, Yiran Qiao, Chaoda Song, Tuo Liang, Jing Ma, Huan Wang and Yu Yin.
    In Advances in Neural Information Processing Systems (NeurIPS), 2025.

    @article{lu2025segment,
      title = {Segment then Splat: Unified 3D Open-Vocabulary Segmentation via Gaussian Splatting},
      author = {Lu, Yiren and Zhou, Yunlai and Qiao, Yiran and Song, Chaoda and Liang, Tuo and Ma, Jing and Wang, Huan and Yin, Yu},
      journal = {Advances in Neural Information Processing Systems (NeurIPS)},
      year = {2025},
      status = {accepted},
      pdf = {https://arxiv.org/pdf/2503.22204v2},
      website = {https://yiren-lu.com/project_pages/Segment-then-Splat/},
      code = {https://github.com/luyr/Segment-then-Splat}
    }
    
  9. BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting.
    Yiren Lu, Yunlai Zhou, Disheng Liu, Tuo Liang and Yu Yin.
    In Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), pp. 16532–16542, 2025.

    @inproceedings{lu2025bard,
      title = {BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting},
      author = {Lu, Yiren and Zhou, Yunlai and Liu, Disheng and Liang, Tuo and Yin, Yu},
      booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)},
      pages = {16532--16542},
      year = {2025},
      status = {accepted},
      pdf = {https://arxiv.org/pdf/2503.15835},
      website = {https://yiren-lu.com/project_pages/BARD-GS/},
      code = {https://github.com/luyr/BARD-GS},
      data = {https://drive.google.com/drive/u/0/folders/1CRBQ_HR3yKhT3G9_ttTWA1PWXWL6DtsV}
    }
    
  10. Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions.
    Zhe Hu, Tuo Liang, Jing Li, Yiren Lu, Yunlai Zhou, Yiran Qiao, Jing Ma and Yu Yin.
    In Advances in Neural Information Processing Systems (NeurIPS), vol. 37, pp. 47166–47188, 2024.

    @article{hu2024cracking,
      title = {Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions},
      author = {Hu, Zhe and Liang, Tuo and Li, Jing and Lu, Yiren and Zhou, Yunlai and Qiao, Yiran and Ma, Jing and Yin, Yu},
      journal = {Advances in Neural Information Processing Systems (NeurIPS)},
      volume = {37},
      pages = {47166--47188},
      year = {2024},
      status = {accepted},
      pdf = {https://openreview.net/pdf?id=bCMpdaQCNW},
      website = {https://vulab-ai.github.io/YESBUT_Homepage/},
      dataset = {https://huggingface.co/datasets/zhehuderek/YESBUT_Benchmark},
      code = {https://github.com/Derekkk/VIVA_EMNLP24}
    }