Detecting Target Objects by Natural Language Instructions Using an RGB-D Camera

Sensors - Switzerland
doi 10.3390/s16122117