The tool can be used to benchmark application domains such as ecommerce, advertising and social networks. It can handle up to billions of rows, schema complexity, and temporal evolution. For each
dataset, Amazon can define relevant predictive tasks, such as estimating missing cell values.