IBM announced a new, promising collection of cloud data services on Monday, adding to an already-impressive collections services on its Bluemix platform. At this point, though, IBM’s biggest challenge isn’t selling enterprise users on the cloud, but convincing them it’s still the best choice.
eBay has open sourced a database technology, called Kylin, that takes advantage of distributed processing and the HBase data store in order to return faster results for SQL queries over Hadoop data.
A new open-source project called Postgres-XL is pushing scale-out and MPP capabilities for the popular database. Postgres-XL is the product of a database vendor called TransLattice and is based on technology it acquired from StormDB in October.
Citus Data, a startup focused on turning PostgreSQL into a scale-out analytic engine, has developed a developed a columnar data store for the popular open source database. The company is open sourcing its extension for single-node environments, although it’s offering a distributed version as part of its CitusDB software. Citus already supported interactive SQL queries over Postgres (on which its technology is based), Hadoop and MongoDB, but columnar stores are faster for certain types of queries. Also, the compression features of the ORC file format that CitusDB uses can cut disk space by more than half.
MapR is continuing along its path to Hadoop glory with new support for the YARN resource manager and a direct integration with the HP Vertica analytic database. In such a competitive space, every little edge matters.
MemSQL, the database startup from two former Facebook engineers, has already raised a lot of money and roped in some big customers. Now it’s looking to broaden its footprint with a flash-optimized columnar store to complement its in-memory row-based one.
Database startup MemSQL has been on fire since it launched in mid-2012, and now it has a lot more money to keep up that momentum. The company has closed an oversubscribed series B round worth $35 million.
Cloudera is touting the speed of its Impala query engine compared to Hive and a leading relational database system, but those aren’t really apples-to-apples comparisons. The real question is how all the SQL-on-Hadoop options stack up against one another.
HP released a new version of its Vertica database than easily connects with other systems to bring in unstructured data. It’s a big update for a database based on analytic SQL workloads but that needs to find a way to play with today’s data formats.
Facebook has open sourced Presto, a SQL engine it says is on average 10 times faster than Hive for running queries across large data sets stored in Hadoop and elsewhere.