Clustrix is the leading scale-out SQL database engineered for the cloud. With Clustrix, you can scale transaction throughput, run real-time analytics, and simplify operations.
Clustrix uses a combination of intelligent data distribution and distributed query processing so you can scale out by simply adding nodes as your Clustrix database needs to grow. A node is simply a computer with processor, memory, and local SSD storage.
Clustrix distributed database, while written from the ground up, appears like a single MySQL database instance to your application simplifying the user experience. Clustrix also provides fault tolerance for high availability and self-managing operations.
For a deep dive into our technology, visit detailed pages on Clustrix Concepts.
SQL, MySQL, and ACID Guarantees
Clustrix supports standard SQL and provides ACID guarantees. You get all the guarantees you have come to expect from your database so you know your business-critical data is safe and secure. Clustrix has extensive SQL support including:
Standard SQL constructs including:
- Joins, sub-queries, outer joins, group by, etc.
- Stored procedures
- Foreign keys
- Partitioned tables
- Temporary tables
- Online schema changes
Clustrix database is on the wire compatible with MySQL (SQL and MySQL extensions), which means that you can use your existing application code and connectors with Clustrix. We also fully support MySQL replication protocols, allowing Clustrix to serve as master and slave to your existing MySQL deployment.
Intelligent Data Distribution
Clustrix achieves scale-out by using the Distributed Shared Nothing Architecture—the only architecture known to scale linearly.
Clustrix splits and distributes data evenly across nodes with each slice having copies on other nodes. Even data distribution is maintained as nodes are added, removed, or data is inserted unevenly. You don’t need to shard or worry about the data distribution. For more details see data distribution.
Clustrix unique query evaluation model brings the query to the data. Combined with our intelligent data distribution, this drives massively parallel processing for all query types. The query is split into query fragments which run across nodes to compute the query in parallel. For more details see evaluation model.
High Availability, Fault Tolerance and Disaster Recovery
Clustrix has a rich array of options to ensure High Availability for your business critical application.
Within a cluster fault tolerance and online schema changes ensure that your cluster stays up as drives and nodes fail and developers update their schema.
Across the clusters fast parallel backup ensure you can backup your cluster periodically while it is running. Also replication across geography ensures you can backup from one cluster to another. For more details see consistency, fault tolerance and availability.
Within a Cluster
As shown in Intelligent Data distribution section, Clustrix maintains multiple copies of all your data. In case of failure, extra copies are automatically generated to replenish the lost ones. This fault tolerance in face of hardware failures ensures you have high availability with no interaction required.
Online schema changes
Clustrix supports agile development with online schema changes that let’s you change the schema of your tables without taking the database down.
Clustrix supports serial MySQL replication which is used to move from existing MySQL to Clustrix. In production Clustrix supports replication between clusters for disaster recovery and high availability.
Clustrix also supports fast parallel backup in addition to standard MySQL backup. This is multiple times faster than serial backup. The backup is a transactionally consistent global copy. Due to multi-version concurrency control, we can back a consistent snapshot of the database.