GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning
We utilize 3DGS serves as a persistent spatial memory for embodied navigation, enabling the agent to ''hallucinate'' optimal views for high-fidelity Vision-Language Model (VLM) reasoning.
Embodied AI
3D Vision
VLM Reasoning
3DGS
Spatial Memory
Zero-Shot Exploration
Embodied Reasoning