An urban scenes reconstruction method includes: acquiring digital data of a three-dimensional subject, the digital data comprising a 2D photograph and a 3D scan; fusing the 3D scan and the 2D photograph to create a depth-augmented photograph; decomposing the depth-augmented photograph into a plurality of constant-depth layers; detecting repetition patterns of each constant-depth layer; and using the repetitions to enhance the 3D scan to generate a polygon-level 3D reconstruction.