Visual Perception For Robotic Spatial Understanding