Yu Yin
Assistant Professor
Computer Vision, 3D Vision, Multimodal Learning, Embodied AI
I am a tenure-track Assistant Professor in the Department of Computer & Data Science at Case Western Reserve University, where I lead the VU Lab. My research focuses on computer vision and 3D vision, multimodal large language models (MLLMs), and embodied AI systems, with the goal of building spatially grounded AI systems that can perceive, reason, and act in complex real-world environments.
At VU Lab, we study 3D vision and spatial representation, including Gaussian splatting (3DGS) and Nerual radiance fields (NeRF), with representative projects such as Reconstruction Matters, Segment then Splat, BARD-GS, and NeRFInvertor. We also investigate spatial intelligence for vision-language and embodied systems, including GSMem and our Spatial Intelligence in VLM survey. In addition, we develop multimodal methods and benchmarks for human-centered reasoning and decision-making, including VIVA, VIVA+, YesBut, YesBut-V2, and When Words Outperform Vision. Overall, our goal is to make multimodal and embodied AI systems more robust, trustworthy, and effective in real-world settings.
Education
- Ph.D. in Computer Engineering, Northeastern University, Boston, USA (2019 - 2023)
- M.S. in Electrical and Computer Engineering, Northeastern University, Boston, USA (2016 - 2018)
- B.E. in Electrical and Information Engineering, Wuhan University of Technology, Wuhan, China (2012 - 2016)
Teaching
- CSDS 570 - Deep Generative Models, Case Western Reserve University, USA, 2025-2026 Spring
- CSDS 465 - Computer Vision, Case Western Reserve University, USA, 2024 Spring & Fall, 2025 Fall
- CSDS 600 - Special Topics on Generative Models, Case Western Reserve University, USA, 2023 Fall
- EECE 5642 - Data Visualization, Northeastern University, USA, 2021 Spring
Selected Awards
- UCITE Glennan Fellowship, Case Western Reserve University, 2026
- Breaking Boundaries Seed Grant, Case Western Reserve University, 2026
- OpenAI Researcher Access Program, 2025
- Teaching Award, Department of Computer and Data Sciences, Case Western Reserve University, USA, 2024
- PhD Spotlight, Northeastern University, USA, 2023
- Dissertation Fellowship, Northeastern University, USA, 2023
- NSF I-Corps Grant, 2022
- PhD Network Grant, Northeastern University, USA, 2019, 2023
Selected Academic Activities
- Area Chair (AC)
- ACL (2025-2026)
- ICLR (2026)
- NeurIPS (2026)
- Workshop Organizer / Program Chair
- ICCV - AMFG (2025)
- ICCV - AMFG (2023)
- CVPR - AMFG (2021)
- FG - RFIW (2020)
- Panel Reviewer
- NSF/NIH, Smart Health (SCH) Program, 2026
- Program Committee Member (2019-Now)
- Journals: TPAMI, TIP, TNNLS, TCyber, TCSVT, IoT, Elsevier
- Conferences: CVPR, ECCV, ICCV, NeurIPS, ICLR, ICML, ACL Rolling Review (ARR), AAAI, IJCAI, ACM MM
Selected Publications
- Segment then Splat: Unified 3D Open-Vocabulary Segmentation via Gaussian Splatting.In Advances in Neural Information Processing Systems (NeurIPS), 2025.
@article{lu2025segment, title = {Segment then Splat: Unified 3D Open-Vocabulary Segmentation via Gaussian Splatting}, author = {Lu, Yiren and Zhou, Yunlai and Qiao, Yiran and Song, Chaoda and Liang, Tuo and Ma, Jing and Wang, Huan and Yin, Yu}, journal = {Advances in Neural Information Processing Systems (NeurIPS)}, year = {2025}, status = {accepted}, pdf = {https://arxiv.org/pdf/2503.22204v2}, website = {https://yiren-lu.com/project_pages/Segment-then-Splat/}, code = {https://github.com/luyr/Segment-then-Splat} } - BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting.In Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), pp. 16532–16542, 2025.
@inproceedings{lu2025bard, title = {BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting}, author = {Lu, Yiren and Zhou, Yunlai and Liu, Disheng and Liang, Tuo and Yin, Yu}, booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)}, pages = {16532--16542}, year = {2025}, status = {accepted}, pdf = {https://arxiv.org/pdf/2503.15835}, website = {https://yiren-lu.com/project_pages/BARD-GS/}, code = {https://github.com/luyr/BARD-GS}, data = {https://drive.google.com/drive/u/0/folders/1CRBQ_HR3yKhT3G9_ttTWA1PWXWL6DtsV} } - Praxis-vlm: Vision-grounded decision making via text-driven reinforcement learning.In Advances in Neural Information Processing Systems (NeurIPS), 2025.
@article{hu2025praxis, title = {Praxis-vlm: Vision-grounded decision making via text-driven reinforcement learning}, author = {Hu, Zhe and Li, Jing and Pu, Zhongzhu and Chan, Hou Pong and Yin, Yu}, journal = {Advances in Neural Information Processing Systems (NeurIPS)}, year = {2025}, status = {accepted}, pdf = {https://arxiv.org/pdf/2503.16965}, code = {https://github.com/Derekkk/Praxis-VLM} } - Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions.In Advances in Neural Information Processing Systems (NeurIPS), vol. 37, pp. 47166–47188, 2024.(Oral)
@article{hu2024cracking, title = {Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions}, author = {Hu, Zhe and Liang, Tuo and Li, Jing and Lu, Yiren and Zhou, Yunlai and Qiao, Yiran and Ma, Jing and Yin, Yu}, journal = {Advances in Neural Information Processing Systems (NeurIPS)}, volume = {37}, pages = {47166--47188}, year = {2024}, status = {accepted}, note = {Oral}, pdf = {https://openreview.net/pdf?id=bCMpdaQCNW}, website = {https://vulab-ai.github.io/YESBUT_Homepage/}, data = {https://huggingface.co/datasets/zhehuderek/YESBUT_Benchmark}, code = {https://github.com/Derekkk/VIVA_EMNLP24} } - When ’YES’ Meets ’BUT’: Can AI Comprehend Contradictory Humor in Comics?In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2026.(Impact Factor: 20.4)
@article{liang2026yesbut, title = {When 'YES' Meets 'BUT': Can AI Comprehend Contradictory Humor in Comics?}, author = {Liang, Tuo and Hu, Zhe and Li, Jing and Zhang, Hao and Lu, Yiren and Zhou, Yunlai and Qiao, Yiran and Liu, Disheng and Peng, Jierui and Ma, Jing and Yin, Yu}, journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)}, year = {2026}, doi = {10.1109/TPAMI.2026.3688191}, note = {Impact Factor: 20.4}, status = {accepted}, pdf = {https://arxiv.org/pdf/2503.23137.pdf}, website = {/projects/yesbut-v2/}, data = {https://huggingface.co/datasets/zhehuderek/YESBUT_Benchmark} }