Why the real world is the next big opportunity in big data

Matt Ocko had been investing in data-based companies for years before he and Zack Bogue officially launched Data Collective Venture Capital in 2012. Since then, the firm has made seed investments in nearly every hot startup in the space, ranging from database companies such as MemSQL to satellite companies such as Planet Labs. This week, Ocko came on the Structure Show podcast to talk about what has him excited as an investor and what’s just overdone.

He has interesting opinions on a lot of topics in the tech space, all of which are worth hearing, but here are the highlights from our interview. If you want to hear more from Ocko, he’ll be speaking at Structure Data (March 18-19 in New York) alongside last week’s podcast guest, Hilary Mason. It should make for a really fun talk about the future of data.

As it turns out, a handful of Data Collective portfolio companies will also be presenting, including Enlitic, Blue River Technology and Interana.

[soundcloud url=”https://api.soundcloud.com/tracks/188532972″ params=”color=ff5500&auto_play=false&hide_related=false&show_comments=true&show_user=true&show_reposts=false” width=”100%” height=”166″ iframe=”true” /]

Download This Episode

Subscribe in iTunes

The Structure Show RSS Feed

Useful doesn’t always means sexy

As sexy as the Snapchats and Slacks of the world are, Ocko is excited to see elite developers — who he says can do wonders with today’s infrastructure technologies — turn their attention toward applications such as supply-chain management or agriculture that can have “a material impact on a pretty big swath of GDP.” These spaces might seem unglamorous, and the companies and technologies might have to sneak up on the world, “a little contrarian and a little unloved,” but the rewards can be huge, Ocko said.

Just look at [company]SAP[/company], he explained:

“[T]hey said, ‘Hey, let’s integrate your accounting, and your supply chain, and your manufacturing and planning, and we’ll tell you what’s coming into your factories, how much of it is being made, how much it costs you and how much you sold it for.’ And that was transformative for manufacturing. That was more transformative, I would argue, than early industrial robots. People’s brains melted out of their ears. This was a massive operational advantage for these companies.”

Or the personal computer, which Ocko said didn’t get a whole lot of love when the concept was first introduced in the 1970s: “And it was way more transformative than any 1,000 mainframes you could have built. It created the industry that we have today.”


Matt Ocko

‘Software eats glassware’

Specifically, Ocko said Data Collective is excited about the potential for new technologies to underpin companies that can fundamentally improve lives, often by focusing on difficult scientific challenges.

“We call it ‘software eats glassware,'” he said. “We see stuff happening in computational biology and related informatics fields where you can model living things without huge capex, or even opex, and get insight into making people and animals and, heck, the planet itself a lot healthier in a way that’s consistent with the capitalist system that we all have to live in, but has profound positive impact while you’re making money.”

Betting on applications, not technologies — even deep learning

Even in the hottest of hot technology areas, Ocko says the focus from an investment standpoint still needs to be whether the technology has real and necessary applications, not just some cool research and maybe a big name. He invested in deep-learning-for-medical-imaging startup Enlitic, for example, but isn’t keen on building a portfolio of deep learning startups just because they’re getting acquired like mad right now.

“Just kind of crossing our fingers and praying for a DeepMind $400 million exit … because people in the company are so brilliant feels kind of dot.commy to me,” Ocko said. “. . . That’s just a recipe for blowing your [limited partners’] money.”

We’re good on Hadoop and marketing software for now

Even with some truly innovative and truly transformative technologies, though, there comes a point when markets become saturated. That doesn’t mean startups in those spaces will crash and burn — they could build nice companies — but they do become less attractive as investment opportunities.

One of them is Hadoop, which Ocko says still could use a lot of finessing, but has probably peaked in terms of producing huge valuations. “If you are lighting up a Hadoop cluster of some sort, there’s still a lot ‘Hadoop for x’ that’s probably needed to make your life easier,” he said. “But to your point, I’m not sure those are giant companies anymore. They may well be very nice, but not homerun — from a VC perspective — acquisitions for a Hortonworks or MapR or Cloudera.”

Another is sales and marketing software souped up with machine learning. “The number of companies I’ve seen pursuing closely related pipeline-mining opportunities, either for marketing optimization or for sales optimization, is literally over 100 now,” Ocko said. “When there are that many closely related companies with similar ideas, I think you’re potentially headed for tragedy.”

Don’t like your cloud vendor? Wait a second.

[soundcloud url=”https://api.soundcloud.com/tracks/188532972″ params=”color=ff5500&auto_play=false&hide_related=false&show_comments=true&show_user=true&show_reposts=false” width=”100%” height=”166″ iframe=”true” /]

If your cloud provider’s lineup leaves you cold, you can jump ship. Or you could wait a minute because things could — probably will — change. There’s already been a ton of consolidation  — the acquisitions of MetaCloud, Eucalyptus, Cloudscaling et al by Cisco, HP, and EMC respectively.

And considerable partnering as well. This week’s news that [company]VMware[/company] and [company]Google[/company] are collaborating to bring key Google cloud services — including BigQuery — to VMware’s vCloud Air is just the latest –and most striking example.

Google potentially gets an entry point to VMware’s enterprise customers and vCloud Air gets more cool large-scale applications to bring in more customers. Depending on how this arrangement is executed it could give both players a better competitive story vis-a-vis [company]Amazon[/company] Web Services.

Speaking of which, the latter continued its push into business applications with AWS WorkMail. Seriously, yet another email product? Wasn’t this category supposed to be dead already?

We also hash out the tectonic shifts at [company]IBM[/company]. First, a key departure — that of Lance Crosby, former CEO of SoftLayer, the company IBM bought in mid 2012 for $2.2 billion. Crosby came over to drive IBM’s cloud strategy and then suddenly he wasn’t. And now he’s gone.

But there are other exits coming too as IBM launches another round of layoffs. They probably won’t amount to the 26 percent cuts reported (110,000 jobs!) but no one really knows how deep they’ll go so stay tuned.

On a cheerier note we hash out cool new business intelligence both from IBM and [company]Microsoft[/company]. And, in the run up to Structure Data, Derrick talks to Matt Ocko,  co-managing partner of DataCollective, who has a good handle on what’s overdone, and what’s truly new-and-important in data. You’ll get a taste of what you’ll hear about at Structure Data in March.  So give it a listen.






Hosts: Barb Darrow and Derrick Harris.

Download This Episode

Subscribe in iTunes

The Structure Show RSS Feed


Hilary Mason on taking big data from theory to reality

On the importance of building privacy into apps and Reddit AMAs

Cheap cloud + open source = a great time for startups 

It’s all Docker containers and the cloud on the Structure Show

Mo’ money, mo’ data, mo’ cloud on the Structure Show

Why CoreOS went its own way on containers



Data Collective adds Todd Papaioannou as EIR

Todd Papaioannou is joining big data-focused venture capital firm Data Collective as a entrepreneur in residence. Papaioannou was most recently co-founder and CEO of Continuuity, and as has held executives roles at companies including Yahoo and Teradata.

Why Trifacta is teaching humans and data to work together

A new startup called Trifacta, founded by UC-Berkeley professor Joe Hellerstein and Stanford professor Jeffrey Heer, wants to eliminate much of the hassle of making messy data usable. The company combines machine learning and human-computer interaction, and has raised $4.3 million from Accel Partners.

Big data VC firm Data Collective steps out of the shadows

Data Collective is a new venture capital firm focusing on seed-round investments in big data startups, from the infraastructure level up to analytics and applications. Among its collection of portfolio companies are Kaggle, MemSQL, Continuuity, Parse, Keen.io, Meteor, MongoHQ, Citus Data and Piston Cloud.