In some example embodiments, a graphical user interface (GUI) is caused to be displayed on a computing device of a user. The GUI can be configured to enable the user to submit an identification of a dataset and at least one configuration parameter. The identification of the data source, the at least one configuration parameter, and the at least one wrangling parameter can be received via the GUI on the computing device. A sampling algorithm can be configured based on the at least one configuration parameter. A sample of data from the dataset can be generated using the configured sampling algorithm. At least one data wrangling operation can be performed on the sample of data based on the at least one wrangling parameter.