Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors