A method of interaction using augmented reality includes capturing a first video image using a camera, generating first augmented reality (AR) coordinates corresponding to the first video image, transmitting AR coordinates and first video image to remote user, receiving first video image and annotations from remote user, capturing a second video image using a camera, generating second AR coordinates corresponding to the second video image, and viewing annotations registered to second video image.