A video object clipping method includes storing, in a storage unit, original images each including a video object to be clipped and reference alpha images representing objects prepared, determining a criteria original image and a criteria reference alpha image from the original images and the reference alpha images, determining a deformation parameter by deforming the criteria reference alpha image to correspond to the criteria original image, and deforming remaining ones of the reference alpha images according to the determined deformation parameter to generate output alpha images corresponding to the original images.