The present disclosure relates to an image processing apparatus and a method that allow for easier and more appropriate rendering. Coded data is generated by encoding a two-dimensional plane image in which position information and attribute information for a point cloud that represents an object having a three-dimensional shape as a group of points are projected onto a two-dimensional plane, and a bitstream that includes the generated coded data and metadata to be used to render the point cloud is generated. The present disclosure can be applied to, for example, an image processing apparatus, an electronic device, an image processing method, a program, or the like.