Our Technology

Designed for Internet-Scale Applications

The Sierra Clustered Database Engine is the revolutionary technology at the heart of the Clustrix Clustered Database System. This fully distributed and parallel architecture was designed from the ground up to provide the enormous scalability, reliability, availability, and performance that Internet-scale applications require.

Unlike other database architectures that fall short in one or more critical areas, the Sierra Clustered Database Engine provides the full spectrum of database capabilities—from the flexible data model and scalability of key-value store architectures through the fully relational, ACID-compliant functionality of SQL.

Sierra Clustered Database Engine: Fully Distributed and Parallel Architecture

Whether it contains three nodes or hundreds of nodes, a Clustrix cluster appears to applications as a single database—although the data is intelligently spread across the nodes. Most traditional database architectures do not provide sufficient concurrency to scale a single database across nodes and still deliver high performance. To accomplish this, a fundamental change was required in query resolution.

The Sierra Clustered Database Engine is the shared-nothing environment that ties the independent, high performance data stores into a global, single-instance database. It processes many simultaneous queries with maximum concurrency, as well as a single query with maximum parallelism. It does this by moving the query to the data, not the data to the query. Data movement and global locking are minimized while concurrency and parallelism are maximized.

To execute queries with maximum efficiency, concurrency, and parallelism, the following Sierra components work together closely:

Sierra Parallel Planner: Breaks queries into fragments and assigns the fragments to the appropriate nodes for processing in parallel.

Sierra Distributed Execution Engine: Processes queries and returns them to the originating nodes. Processing queries in a distributed fashion—eliminating the bottleneck of central coordination common in database clusters—enables the system to scale in a linear fashion.

Scaling is Easy, Non-Disruptive, and Linear

Because the Sierra Clustered Database Engine software resides on every node of a Clustrix Distributed Database, scaling is as simple and easy as adding another CLX 4000 node—even while applications are running. Data is automatically moved to new nodes to rebalance the cluster. With each additional node, the database scales incrementally.

Resources

Sierra Clustered Database Engine Whitepaper