Filtering the digital exhaust

If 80 percent of new data created is going to be unstructured, where is all that data coming from? It’s coming from consumers’ activities online and it requires real-time processing, said Continuuity’s Todd Papaionnou at Structure:Data on Wednesday.

Todd Papaioannou of Continuuity at Structure:Data 2012

(c) 2012 Pinar Ozger. [email protected]

Papaionnou calls this unstructured data “digital exhaust,” everything consumers do on a daily basis — clicks, tweets, searches, Facebook posts. Companies can use it to offer more customized experiences for consumers online – content and deal targeting, advertising and sentiment analysis — but they have to process it first.
“Who cares if you’re able to get data signals if you can’t act on it?” Papaiaonnou said, noting Hadoop isn’t real time. “To get to real time you have to take the human out of the equation and [start] allowing machines to make these decisions.” He pointed to initiatives like Yahoo’s S4 and Twitter’s Storm as ones to watch.
Watch the livestream of Structure:Data here.