The Amazon Database-as-a-Service Landscape

There is a rich ecosystem of services available on Amazon. While it’s easy to scale application servers and increasingly simple to deploy applications, there are fewer options at the database layer. We created a landscape to help you find the right database solution on Amazon for your business.

For offline analytics, Amazon Redshift will provide you with a columnar SQL database. If you need analytics to be more flexible or have unstructured data, Amazon EMR will help you run map-reduce jobs. Both of these solutions scale by adding nodes to handle larger workloads.

If you need a primary database, you have two options: Amazon DynamoDB, a scale-out NoSQL solution that provides simple key-value lookup, and Amazon RDS, a single-node SQL solution alternative that provides MySQL, SQL Server, or Oracle with space up to 1TB.

Up until now, Amazon users were out of luck if they wanted a scale-out SQL primary database that could scale data beyond 1TB (the maximum query load that RDS provides) or had query loads that were higher than a single node could handle.

This is where Clustrix comes in. Clustrix provides a scale-out SQL primary database that can grow by simply adding nodes. It speaks MySQL protocol on the wire, making the move from MySQL seamless.

Clustrix provides:

  • Near linear transaction scale – add nodes to handle more transactions
  • Fast real-time analytic capabilities – add nodes to make analytics and reporting faster
  • High availability – within a cluster, you can keep running even if you lose a node, and you can set up a DR cluster across geographies

This means that when you finally outgrow RDS, you don’t have to move out of AWS or shard. You can simply use Clustrix. Better yet, you can start off on Clustrix so you never have to switch as your application grows.