Amazon Web Services has improved the performance of its Redshift data warehouse with new SSD-based nodes, which can also lower the cost of the service as long as storage capacity needs are also low.
Just like with many of its other hosted services, Amazon contends that Redshift lowers the bar for implementing and managing, in this case, a data warehouse. IT takes care of the work needed to set up and operate a data warehouse, including provisioning the infrastructure and automating tasks such as backups and patching.
Redshift data warehouses are made up of clusters of Dense Storage nodes or the new SSD-based Dense Compute nodes. The storage nodes allow enterprises to create very large data warehouses using hard disk drives for a low price per gigabyte, while the compute nodes allow enterprises to build high-performance data warehouses using faster CPUs, large amounts of RAM and SSD storage.
The compute nodes are ideal for enterprises which have less than 500GB of data in their warehouse or whose primary focus is performance. The storage nodes are a better fit when performance isn't as critical and storage demands are high but the budget isn't.
On-demand prices for a single Large Dense Compute node start at US$0.25 per hour. For that users get 160GB of SSD storage, two Intel Xeon E5-2670v2 virtual cores (based on Ivy Bridge) and 15GB of RAM. A single Extra Large Dense Storage node may cost from $0.85 per hour, but it has 2TB of storage. It also has two Intel Xeon E5-2650 virtual cores (based on Sandy Bridge) and 15GB of RAM.
For users that want better performance and more storage per node, Amazon also offers Eight Extra Large nodes in both Compute and Storage variants. Warehouses based on those two can contain up to 100 nodes, while ones based on the smaller nodes can contain up to 32 nodes.
Scaling a cluster up and down or switching between node types is done using API calls or the AWS Management Console.
Sign up for CIO Asia eNewsletters.