Hadoop is the software that stitches together these commodity servers into a big data platform. With Hadoop, a single node in the cluster is capable of having 16, 3 terabyte (TB) disks or 48TB of data per node in a cluster. Instead of enterprise storage solutions costing between $10,000 and $125,000 per TB, Hadoop delivers an analytic and storage platform for a couple hundred dollars per TB. If more processing or data capacity is needed, simply add additional nodes to the cluster. The MapR distribution for Hadoop, for example, provides an enterprise grade platform. Some examples of this power include the ability to:
- Create target marketing applications that leverage transaction data and customer interactions with content recommendations to develop significant new revenue opportunities
- Improved accuracy and timeliness of fraud detection, operation analytics and quality management
- Scale operations with at least 10-to-1 cost efficiencies over traditional servers, NAS or SAN alternatives
Any one of these examples could be the basis of a competitive advantage and increased profitability. That's why many organizations have started to deploy Hadoop or are actively investigating its use.
The year 2012 brought important big data advancements and demonstrated success in large enterprises and small organizations, both on-premises and in the cloud. But there is more to come in the future. Hadoop is about to become even more powerful and capable. No longer do enterprises need to limit their Hadoop usage to batch processing. Now they'll be able perform analytics using batch or real-time processing, which will expand use cases and applications.
We're going to see these kinds of real-time capabilities emerge across the whole Datacosm in 2013, as well as the ability to integrate NoSQL processing directly into the Hadoop framework to provide new capabilities for integrated data analysis. All of these advances will help businesses as they work to refine and improve their own uses of their massive stores of customer, transactional and machine-generated data.
The Datacosm has the power to transform the competitive dynamics of entire industries. Gilder has documented two successive waves that brought tremendous change and opportunity. Datacosm represents a third wave.
Are you ready?
Sign up for CIO Asia eNewsletters.