The company has also tightened the core code base to run faster subqueries, database statistics and other routine database operations. The backup capabilities have been expanded. And this release features cluster-cloning capabilities, or the ability to break off a piece of the database and run it in its own sandboxed environment.
"So if you have a 200-node cluster and want to spin off a sandbox for a separate data mart, you can point Vertica at the new server cluster, hit a button, and it will automatically ship the data off" to this new cluster, he said.
The new release "plays to the strengths that Vertica already has in terms of scalability, parallelism and rapid provisioning," said James Kobielus, an IT industry analyst focusing on data warehousing for Forrester Research. Forrester has predicted that by the end of this year, "pretty much every data warehouse vendor will have native support for the MapReduce APIs (application programming interfaces) within their core products," he said, noting that Teradata's Aster Data and EMC's Greenplum units both offer MapReduce support.
"Everyone is going down these lines fairly quickly," he said. Because MapReduce is an open framework, an organization could, in theory, build a MapReduce model for one data warehouse and have it run on another data warehouse from a different vendor.
The software is available either as a download or packaged in an appliance. Pricing will be based on the amount of data analyzed.
Sign up for CIO Asia eNewsletters.