Three-dimensional visual servoing for positioning a robot in an environment is facilitated. Three-dimensional point cloud data of a scene of the environment is obtained, the scene including a feature. The three-dimensional point cloud data is converted into a two-dimensional image, and a three-dimensional position of the feature is identified based on the two-dimensional image. An indication of the identified three-dimensional position of the feature is then provided.