An image processing apparatus and method are provided. The image processing apparatus acquires a target image including a depth image of a scene, determines three-dimensional (3D) point cloud data corresponding to the depth image based on the depth image, and extracts an object included in the scene to acquire an object extraction result based on the 3D point cloud data.