Text this: Multi-view geometry based visual perception and control of robotic systems