The present disclosure provides a method and system for realizing interaction in augmented reality. The method includes: collecting a frame image and uploads the frame image; recognizing a template image that matches the frame image and returning the template image; detecting a marker area of the frame image according to the template image; and superposing media data corresponding to the template image on the marker area and displaying the superposed image.