MySQL: The Real Grid Database, Mark Callaghan, Chip Turner
A tremendous amount of work also done by Wei Li and Gene Pang.
Google has a large MySQL deployment, and they enhance it as needed.
MySQL@Google: too many queries, transactions, data, and rapid growth. Real workload with OLTP and reporting. Workload at Google is *critical*.
The well known solution is to deploy a “grid database”:
- use many replicas to scale read performance
- shard your data over many masters to scale write performance (vertical partitioning of data)
- sharding is easy, resharding is hard
Large number of small servers, not much capacity lost when a server fails, support as many servers as possible with a few DBAs.
Manageability is important at Google - make all tasks scriptable. Gives you time to solve more interesting problems, and …
[Read more]