Planet MySQL

Displaying posts with tag: partitioning (reset)

Aug

2023

The Ultimate Guide to MySQL Partitions

Posted by MySQL Performance Blog on Sat 05 Aug 2023 15:39 UTC
Tags:

innodb, partitioning, tables, partition, hash, range, Insight for DBAs, MySQL

This blog was originally published in July 2017 and was updated in August 2023.

It’s a pretty common question around here, so let’s see what we can do about that.

So, What is MySQL Partitioning?

Partitioning is a way in which a database (MySQL in this case) splits its actual data down into separate tables but still gets treated as a single table by the SQL layer.

When partitioning in MySQL, it’s a good idea to find a natural partition key. You want to ensure that table lookups go to the correct partition or group of partitions. This means that all SELECT, UPDATE, and DELETE should include that column in the WHERE clause. Otherwise, the storage engine does a scatter-gather and queries ALL partitions in a UNION that is not concurrent.

Generally, you must add the partition key into the primary key along …

[Read more]

Mar

2021

Designing a Thread Pipeline for optimal database throughput with high IPC and low CPU cache misses

Posted by Mikael Ronström on Tue 16 Mar 2021 17:04 UTC
Tags:

MySQL Cluster, partitioning, NDB Cluster, ndb, MySQL, MySQL NDB Cluster, Computer Architecture, RonDB, Thread Architecture

There are a couple of questions about the blog post on automatic thread configuration in RonDB. Rather than providing an extensive answer in the comment section I thought it was better to answer the questions in a separate blog. In addition I performed a set of microbenchmarks to verify my expectations.

The first question was the following:

ScyllaDB also does something similar with execution stages for better instruction cache. Is this similar to what you're implementing for instruction cache on separating the threads?

The second question is:

I don't really get the functional separation of threads. To execute a query, your data will now go through a pipeline of …

[Read more]

May

2020

Automate MySQL Partitioning using events and procedures

Posted by Ted Wennmark on Wed 20 May 2020 06:59 UTC
Tags:

Events, partitioning, Automation, procedures, MySQL

The purpose of this post if to show how you can automate the creation and pruning of partitioned tables. If you want to read about partitioning I recommend reading our manual.

In short partitioning makes it possible to spread your individual tables across a file system according to the partition rules you specify.

Reasons for partition your tables might be:
- Insert performance, smaller index trees for stable insert throughput.
- Select performance, only read data from selected (aka partitioning pruning) partitions.
- Delete performance, drop partitioning is must quicker than doing range deletes of old data.

Partitioning definitions is part for the CREATE/ALTER table statements, in …

[Read more]

Aug

2017

Automatic Partition Maintenance in MySQL and MariaDB: Part 3

Posted by Geoff Montee on Thu 31 Aug 2017 21:53 UTC
Tags:

partitioning, mariadb, MySQL

In part 1 and part 2 of this blog series, I showed how a DBA could configure MySQL or MariaDB to automatically drop old partitions. Some readers mentioned that they would also like for new partitions to be automatically created. In this blog post, I will show a stored procedure that can automatically create new partitions, and I will tie ... Read More

Aug

2017

Automatically Dropping Old Partitions in MySQL and MariaDB: Part 2

Posted by Geoff Montee on Wed 30 Aug 2017 21:10 UTC
Tags:

partitioning, mariadb, MySQL

In a previous blog post, I showed how a DBA could configure MySQL or MariaDB to automatically drop old partitions. Some readers provided some feedback on some issues that they’ve run into while doing similar operations. Specifically: It can sometimes help to maintain an empty first partition when partitioning by dates, since partition pruning cannot always eliminate the first partition. ... Read More

Aug

2017

Automatically Dropping Old Partitions in MySQL and MariaDB

Posted by Geoff Montee on Wed 23 Aug 2017 19:16 UTC
Tags:

partitioning, mariadb, MySQL

A MariaDB Support customer recently asked how they could automatically drop old partitions after 6 months. MariaDB and MySQL do not have a mechanism to do this automatically out-of-the-box, but it is not too difficult to create a custom stored procedure and an event to call the procedure on the desired schedule. In this blog post, I will show one ... Read More

Feb

2017

When order of appearance of indexes matters in MySQL

Posted by Rick Pizzi on Thu 23 Feb 2017 15:04 UTC
Tags:

Replication, bug, partitioning, debugging, optimizer

Sometimes MySQL surprises you in ways you would have never imagined.

Would you think that the order in which the indexes appear in a table matters?
It does. Mind you, not the order of the columns - the order of the indexes.
MySQL optimizer can, in specific circumstances, take different paths, sometimes with nefarious effects.

Please consider the following table:

CREATE TABLE `mypartitionedtable ` (
  `HASH_ID` char(64) NOT NULL,
  `RAW_DATA` mediumblob NOT NULL,
`EXPIRE_DATE` timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP,
  KEY `EXPIRE_DATE_IX` (`EXPIRE_DATE`),
  KEY `HASH_ID_IX` (`HASH_ID`)
) ENGINE=TokuDB DEFAULT CHARSET=latin1 ROW_FORMAT=TOKUDB_UNCOMPRESSED
/*!50100 PARTITION BY RANGE (UNIX_TIMESTAMP(EXPIRE_DATE))
(PARTITION p2005 VALUES LESS THAN (1487847600) ENGINE = …

[Read more]

Jan

2017

What is the default sharding key in MySQL Cluster?

Posted by Abdel-Mawla Gharieb on Mon 09 Jan 2017 08:51 UTC
Tags:

MySQL Cluster, partitioning, sharding, MySQL, Abdel-Mawla Gharieb, High Availability (HA)

MySQL Cluster does an automatic sharding/partitioning to the tables across data nodes, enabling databases to scale horizontally to serve read and write-intensive workloads, but what is the default sharding key used in partitioning the data?
According to the recent update (Oct, 2016) of the MySQL Cluster white paper, primary key is the default sharding key:

By default, sharding is based on hashing of the primary key, which generally leads to a more even distribution of data and queries across the cluster than alternative approaches such as range partitioning.

However, that is not the case in all MySQL Cluster versions so far!
In this post, I’ll do some test cases on MySQL Cluster (of 4 datanodes) to confirm the default sharding key.

Testing on MySQL Cluster 7.2.26 …

[Read more]

Dec

2016

Partitions number in MySQL Cluster

Posted by Abdel-Mawla Gharieb on Fri 16 Dec 2016 08:30 UTC
Tags:

MySQL Cluster, partitioning, sharding, MySQL, Abdel-Mawla Gharieb, High Availability (HA)

As stated in the MySQL Cluster documentation:

Partition. This is a portion of the data stored by the cluster. There are as many cluster partitions as nodes participating in the cluster. Each node is responsible for keeping at least one copy of any partitions assigned to it (that is, at least one replica) available to the cluster.

According to my understanding for the previous paragraph, if we have a cluster of 6 datanodes we should have 6 partitions for each NDB table. I claim that this is not true for all cases – at least, after the introduction of ndbmtd (Multi-Threaded Daemon) in MySQL Cluster 7.2 .
In this post, I’ll do some …

[Read more]

Sep

2016

If enforcing data retention via DROP PARTITION, better watch your auto_increment counter

Posted by Rick Pizzi on Tue 27 Sep 2016 14:40 UTC
Tags:

bug, partitioning, auto_increment, MySQL

One of the main tasks that any DBA has to deal with is certainly data retention. Especially when your developers like to store logging and tracking information inside the database. This is not something that I would recommend (don't try it at home!) but when you're late to the party (read: someone had taken this path before you joined) , then you'll have to live with it.

Data retention in MySQL is usually applied by partitioning the table by RANGE on a selected column, usually containing the row creation timestamp. New partitions are created in advance for the coming days or weeks or months, and a script is used that will run nightly and drop partitions that are older than a set retention.

There are many tools that can be used to automate data retention; I have chosen the excellent pdb-parted, a nice and cozy perl script that you can find in the PalominoDB repository on GitHub (since PalominoDB is no longer in existence, …

[Read more]

Top Authors

Oracle MySQL Blogs

Vendor Blogs

MySQL Links