Planet MySQL

Displaying posts with tag: distributed-systems (reset)

Oct

2011

Some MySQL projects I think are cool - Shard-Query

Posted by Frazer Clement on Wed 12 Oct 2011 13:00 UTC
Tags:

design, NoSQL, parallel, distributed-systems, MySQL

I've already described Justin Swanhart's Flexviews project as something I think is cool. Since then Justin appears to have been working more on Shard-Query which I also think is cool, perhaps even more so than Flexviews.

On the page linked above, Shard-Query is described using the following statements :

"Shard-Query is a distributed parallel query engine for MySQL"
"ShardQuery is a PHP class which is intended to make working with a partitioned dataset easier""ParallelPipelining - MPP distributed query engines runs fragments of queries in parallel, combining the results at the end. Like map/reduce except it speaks SQL directly."

The things I like from the above description :

Distributed

[Read more]

Oct

2011

Eventual consistency with transactions

Posted by Frazer Clement on Mon 10 Oct 2011 00:26 UTC
Tags:

Replication, design, cluster, NoSQL, distributed-systems, latency-hiding, MySQL, active-active

In my last post I described the motivation for the new NDB$EPOCH conflict detection function in MySQL Cluster. This function detects when a row has been concurrently updated on two asynchronously replicating MySQL Cluster databases, and takes steps to keep the databases in alignment.

With NDB$EPOCH, conflicts are detected and handled on a row granularity, as opposed to column granularity, as this is the granularity of the epoch metadata used to detect conflicts. Dealing with conflicts on a …

[Read more]

Oct

2011

Eventual consistency with MySQL

Posted by Frazer Clement on Mon 03 Oct 2011 12:50 UTC
Tags:

Replication, design, cluster, NoSQL, distributed-systems, latency-hiding, MySQL, active-active

tl;dr : New 'automatic' optimistic conflict detection functions available giving the best of both optimistic and pessimistic replication on the same data
MySQL replication supports a number of topologies, and one of the most interesting is an active-active, or master-master topology, where two or more Servers accept read and write traffic, with asynchronous replication between them.

This topology has a number of attractions, including :

Potentially higher availability
Potentially low impact on read/write latency
Service availability insensitive to replication failures
Conceptually simple

…

[Read more]

Apr

2011

Journey upriver to the dark heart of ha_ndbcluster

Posted by Frazer Clement on Sat 02 Apr 2011 00:05 UTC
Tags:

design, cluster, parallel, distributed-systems, latency-hiding, MySQL

Unlike most other MySQL storage engines, Ndb does not perform all of its work in the MySQLD process. The Ndb table handler maps Storage Engine Api calls onto NdbApi calls, which eventually result in communication with data nodes. In terms of layers, we have SQL -> Handler Api -> NdbApi -> Communication. At each of these layer boundaries, the mapping between operations at the upper layer to operations at the lower layer is non trivial, based on runtime state, statistics, optimisations etc.

The MySQL status variables can be used to understand the behaviour of the MySQL Server in terms of user commands processed, and also how these map to some of the Storage Engine Handler Api calls.

Status variables tracking user commands start with …

[Read more]

Mar

2011

MySQL Cluster online scaling

Posted by Frazer Clement on Sun 27 Mar 2011 23:41 UTC
Tags:

design, cluster, distributed-systems, MySQL

Most people looking at a diagram showing the Cluster architecture soon want to know if the system can scale online. Api nodes such as MySQLD processes can be added online, and the storage capacity of existing data nodes can be increased online, but it was not always possible to add new data nodes to the cluster without an initial system restart requiring a backup and restore.

An online add node and data repartitioning feature was finally implemented in MySQL Cluster 7.0. It's not clear how often users actually do scale their Clusters online, but it certainly is a cool thing to be able to do.

There are two parts to the feature :

Online add an empty data node to an existing cluster
Online rebalance existing data across the existing and new data nodes

Adding an empty data node to a cluster sounds trivial, but is actually fairly complex given the cluster's …

[Read more]

Mar

2011

Data distribution in MySQL Cluster

Posted by Frazer Clement on Sat 26 Mar 2011 00:43 UTC
Tags:

design, cluster, parallel, distributed-systems, message-passing, MySQL

MySQL Cluster distributes rows amongst the data nodes in a cluster, and also provides data replication. How does this work? What are the trade offs?

Table fragments

Tables are 'horizontally fragmented' into table fragments each containing a disjoint subset of the rows of the table. The union of rows in all table fragments is the set of rows in the table. Rows are always identified by their primary key. Tables with no primary key are given a hidden primary key by MySQLD.

By default, one table fragment is created for each data node in the cluster at the time the table is created.

Node groups and Fragment replicas

The data nodes in a cluster are logically divided into Node groups. The size of each Node group is controlled by the NoOfReplicas parameter. All data nodes in a Node group store the same data. In other words, where the NoOfReplicas parameter is two or greater, each …

[Read more]

Jan

2011

Low latency distributed parallel joins

Posted by Frazer Clement on Wed 26 Jan 2011 22:51 UTC
Tags:

design, cluster, NoSQL, parallel, distributed-systems, latency-hiding, MySQL

When MySQL AB bought Sun Microsystems in 2008 (or did Sun buy MySQL?), most of the MySQL team merged with the existing Database Technology Group (DBTG) within Sun. The DBTG group had been busy working on JavaDB, Postgres and other DB related projects as well as 'High Availability DB' (HADB), which was Sun's name for the database formerly known as Clustra.

Clustra originated as a University research project which spun out into a startup company and was then acquired by Sun around the era of dot-com. A number of technical papers describing aspects of Clustra's design and history can be found online, and it is in many ways similar to Ndb Cluster, not just in their shared Scandinavian roots. Both are shared-nothing parallel databases originally aimed at the Telecoms market, supporting high availability and horizontal scalability. Clustra has an impressive feature set and …

[Read more]

Sep

2010

Some MySQL projects I think are cool - Spider Storage Engine

Posted by Frazer Clement on Mon 27 Sep 2010 10:39 UTC
Tags:

parallel, distributed-systems, MySQL

One thing that has puzzled me about MySQL Server is that it became famous for sharded scale-out deployments in well known web sites and yet has no visible support for such deployments. The MySQL killer feature for some time has been built-in asynchronous replication and gigabytes of blogs have been written about how to setup, use, debug and optimise replication, but when it comes to 'sharding' there is nothing built in. Perhaps to have attempted to implement something would have artificially constrained user's imaginations, whereas having no support at all has allowed 1,000 solutions to sprout? Perhaps there just wasn't MySQL developer bandwidth available, or perhaps it just wasn't the best use of the available time. In any case, it remains unclaimed territory to this day.

On first hearing of the Federated storage engine some years ago, …

[Read more]

Mar

2010

ACID tradeoffs, modularity, plugins, Drizzle

Posted by Frazer Clement on Thu 25 Mar 2010 09:48 UTC
Tags:

General, design, cluster, NoSQL, distributed-systems, rambling, MySQL

Most software people are aware of the ACID acronym coined by Jim Gray. With the growth of the web and open source, the scaling and complexity constraints imposed on DBMS implementations supporting ACID are more visible, and new (or at least new terms for known) compromises and tradeoffs are being discussed widely. The better known NoSQL systems are giving insight by example into particular choices of tradeoffs.

Working at MySQL, I have often been surprised at the variety of potential alternatives when implementing a DBMS, and the number of applications which don't need the full set of ACID letters in the strictest form. The original MySQL storage engine, MyISAM is one of the first and most successful examples of an 'ACID remix'. The people …

[Read more]

Sep

2009

Ndb software architecture

Posted by Frazer Clement on Mon 28 Sep 2009 21:43 UTC
Tags:

cluster, parallel, distributed-systems, message-passing, MySQL

I'm sure that someone else can describe the actual history of Ndb development much better, but here's my limited and vague understanding.

Ndb is developed in an environment (Ericsson AXE telecoms switch) where Ericsson's PLEX is the language of choice
PLEX supports multiple state machines (known as blocks) sending messages (known as signals) between them with some system-level conventions for starting up, restart and message classes. Blocks maintain internal state and define signal handling routines for different signal types. Very little abstraction within a block beyond subroutines is supported. (I'd love to hear some more detail on PLEX and how it has evolved). This architecture maps directly to the AXE processor design (APZ) …

[Read more]

Top Authors

Oracle MySQL Blogs

Vendor Blogs

MySQL Links