Patent attributes
Computer implemented methods and apparatus for extracting list information into database tables. A number of fields are independently determined for items in list. A number of database table columns are determined from most common number of list item fields. New fields are determined for items with more fields than database columns. Null fields are inserted into items with fewer fields than database columns. Information from items having the same number of fields as database columns is written to database table rows. Information from each field is written to a corresponding database table column. Streaks of poorly matching cells in a database table row are determined. Streak cells are merged and new cells are determined. Null cells are inserted if number of new cells is less than number of cells in the streak. Information from the new cells is written to the table row and columns that define the streak.