A method and system for incorporating voice commands into the interactive process of image segmentation. Interactive image segmentation involves a user pointing at an image; voice commands quicken this interaction by indicating the purpose and function of the pointing. Voice commands control the governing parameters of the segmentation algorithm. Voice commands guide the system to learn from the user's actions, and from the user's manual edits of the results from automatic segmentation.