Data might be the new oil, but a lot of us just need gasoline

One of the biggest tropes in the era of big data is that data is the new oil — it’s very valuable to the companies that have it, but only after it has been mined and processed. The analogy makes some sense, but it ignores the fact that people and companies don’t have the means to collect the data they need or the ability to process it once they have it. A lot of us just need gasoline.

Which is why I was excited to see the new Data for Everyone initiative that crowdsourcing startup CrowdFlower released on Wednesday. It’s a library of interesting and free datasets that have been gathered by CrowdFlower’s users over the years and verified by the company’s crowdsourced labor force. Topics range from Twitter sentiment on various subjects to a collection of labeled medical images.

Data for Everyone is far from comprehensive or from being any sort of one-stop shop for data democratization, but it is a good approach to a problem that lots of folks have been trying to solve for years. Namely, giving people interested in analyzing valuable data access to that data in a meaningful way. Unfortunately, early attempts at data marketplaces such as Infochimps and Quandl, and even earlier incarnations of the federal Data.gov service, often included poorly formatted data or suffered from a dearth of interesting datasets.

An example of what's available in Data for Everyone.

An example of what’s available in Data for Everyone.

It’s often said that data analysts spend 85 percent of their time formatting data and only 15 percent of it actually analyzing data — a situation that is simply untenable for people whose jobs don’t revolve around data, even as tools for data analysis continue to improve. All the Tableau software or Watson Analytics or DataHero or PowerBI services in the world don’t do a whole lot to help mortals analyze data when it’s riddled with errors or formatted so sloppily it takes a day just to get it ready to upload.

Hopefully, we’ll start to see more high-quality data markets pop up, as well as better tools for collecting data from services such as Twitter. They don’t necessarily need to be so easy a 10-year-old can use them, but they do need to be easy enough that someone with basic programming or analytic skills can get up and running without quitting their day job. Data for Everyone looks like one, as does the new Wolfram Data Drop, also announced on Wednesday.

Because while it’s getting a lot easier for large companies and professional data scientists to collect their data and analyze it for purposes ranging from business intelligence to training robotic brains — topics we’ll be discussing at our Structure Data conference later this month — the little guy, strapped for time and resources, still needs more help.

CrowdFlower raises $12.5M to deliver better data for better models

Crowdsourcing startup CrowdFlower has raised another $12.5 million as it tries to make life better for the data science community. As people try to get better, faster data to power their predictive models, CrowdFlower’s API-focused approach is proving pretty popular.

Exclusive: CrowdControl launches, brings AI to crowdsourcing

A new startup called CrowdControl is launching today, and it aims to bring order to the world of crowdsourcing by using artificial intelligence to judge workers’ accuracy. Think Amazon Mechanical Turk, only with a quality control mechanism in place to help ensure jobs get done right.

Crowdsourcing Allows for On-Demand Work Capacity

There is a new workplace infrastructure, and it has moved out of the office and into the cloud. The human cloud lets SMBs and enterprises hire top talent, reduce overhead costs, and use online technology to assemble and manage teams to work done.

The Future of Work Won’t Contain Resumes

Technology is making the resume obsolete. Now, some candidates send LinkedIn profiles in lieu of resumes. But sites like oDesk and eLance more closely reflect the future of resumes and how companies hire because they use reputation data to shed light on a candidate.

Give Work App Lets You Do Good Anywhere

give work 1Two San Francisco-based startups — Samasource and CrowdFlower — today released a free iPhone application in the iTunes App Store called Give Work that lets you spend a few seconds of your time helping Kenyan refugees earn money, and in turn, improve their quality of life. An fbFund startup, Samasource is a non-profit that provides tech work for women, youth and refugees in countries such as Kenya and Pakistan. CrowdFlower, meanwhile, pairs businesses with pools of workers from such regions who can complete simple tasks that a computer can’t, such as removing spam from a company blog.

Read More about Give Work App Lets You Do Good Anywhere