Patent attributes
The present disclosure relates to language agnostic unsupervised removal of text from form images. According to one embodiment, a method comprises generating a spectral domain representation of an image by applying a two dimensional frequency domain transformation, where the image depicts form layout elements and text elements. Applying a first filter to the spectral domain representation to remove a portion of the frequency domain corresponding to the text element, and applying an inverse two dimensional frequency domain transformation to the filtered spectral domain representation of the image to generate a reconstructed image. The text elements are not depicted in the reconstructed image.