Patent attributes
The present application relates to a method performed at an electronic device for parsing tables in a PDF document. The method includes the following steps: receiving the PDF document containing a table area; extracting horizontal lines, vertical lines and text blocks in the table area; determining the types of tables in the table area according to the extracted horizontal lines and vertical lines; if the table is a quasi full-line table, determining the structure of the quasi full-line table in the table area according to the horizontal lines and the vertical lines in the table area with the assistance of the text blocks in the table area; and if the table is a quasi non-line table, determining the structure of the quasi non-line table in the table area according to the text blocks in the table area with the assistance of the horizontal lines and/or the vertical lines in the table area.