From Amazon’s top data geek: data has got to be big — and reproducible

Big data and the horsepower needed to generate, store and manage it is all great. Now we need to make sure our data is reproducible, says AWS principle data scientist.