Planet MySQL

Displaying posts with tag: Statistics (reset)

Feb

2018

Catching Slow and Frequent Queries with ProxySQL

Posted by MySQL Performance Blog on Tue 27 Feb 2018 21:54 UTC
Tags:

monitoring, Statistics, query, slow queries, Insight for DBAs, MySQL, proxysql, stats_mysql_query_digest

In this blog post, I’ll look at how to catch slow and frequent queries with ProxySQL.

More and more people are using ProxySQL because it is a great tool and it can help DBAs a lot. But many people do not realize that it is more powerful than it looks. It has many features and possibilities. I am going to show you one of my favorite “tricks” / use cases.

There are plenty of blog posts explaining how ProxySQL works. I am not going to that again. Instead, let’s jump straight to the point. There is a table in ProxySQL called “stats.stats_mysql_query_digest”. It is one of my favorite tables because it basically records all the queries that were running against ProxySQL. Without collecting any queries on the MySQL server, I can find …

[Read more]

Feb

2018

Understand Your Prometheus Exporters with Percona Monitoring and Management (PMM)

Posted by MySQL Performance Blog on Tue 20 Feb 2018 22:40 UTC
Tags:

Statistics, metrics, database monitoring, MySQL, Prometheus, Percona Monitoring and Management, PMM, exporters

In this blog post, I will look at the new dashboards in Percona Monitoring and Management (PMM) for Prometheus exporters.

Percona Monitoring and Management (PMM) uses Prometheus exporters to capture metrics data from the system it monitors. Those Prometheus exporters are an important part of your monitoring infrastructure, and understanding their performance and other operational details is critical for well-implemented monitoring.

To help you with this we’ve added a number of new dashboards to Percona Monitoring and Management.

The Prometheus Exporters Overview dashboard provides a high-level overview of your installed Prometheus exporter …

[Read more]

Oct

2017

Big Dataset: All Reddit Comments – Analyzing with ClickHouse

Posted by MySQL Performance Blog on Tue 03 Oct 2017 00:11 UTC
Tags:

postgresql, Statistics, mongodb, metrics, database monitoring, MySQL, MySQL 8.0, ClickHouse, Yandex ClickHouse

In this blog, I’ll use ClickHouse and Tabix to look at a new very large dataset for research.

It is hard to come across interesting datasets, especially a big one (and by big I mean one billion rows or more). Before, I’ve used on-time airline performance available from BUREAU OF TRANSPORTATION STATISTICS. Another recent example is NYC Taxi and Uber Trips data, with over one billion records.

However, today I wanted to mention an interesting dataset I found recently that has been available since 2015. This is Reddit’s comments and submissions dataset, made possible thanks to Reddit’s generous API. The …

[Read more]

Sep

2017

Updating InnoDB Table Statistics Manually

Posted by Sveta Smirnova of MySQL Performance Blog on Mon 11 Sep 2017 19:00 UTC
Tags:

innodb, monitoring, Statistics, database monitoring, Insight for DBAs, Insight for Developers, MySQL, InnoDB tables

In this post, we will discuss how to fix cardinality for InnoDB tables manually.

As a support engineer, I often see situations when the cardinality of a table is not correct. When InnoDB calculates the cardinality of an index, it does not scan the full table by default. Instead it looks at random pages, as determined by options innodb_stats_sample_pages, innodb_stats_transient_sample_pages and innodb_stats_persistent_sample_pages, or …

[Read more]

Jul

2017

Multi-Threaded Slave Statistics

Posted by MySQL Performance Blog on Wed 19 Jul 2017 17:02 UTC
Tags:

Replication, Statistics, Insight for DBAs, MySQL, MTS, multi-threaded slave

In this blog post, I’ll talk about multi-threaded slave statistics printed in MySQL error log file.

MySQL version 5.6 and later allows you to execute replicated events using parallel threads. This feature is called Multi-Threaded Slave (MTS), and to enable it you need to modify the

slave_parallel_workers

variable to a value greater than 1.

Recently, a few customers asked about the meaning of some new statistics printed in their error log files when they enable MTS. These error messages look similar to the example stated below:

[Note] Multi-threaded slave statistics for channel '': seconds elapsed = 123; events assigned = 57345; worker queues filled over overrun level = 0; waited due a Worker queue full = 0; waited due the total size = 0; waited at clock conflicts = 0 waited (count) …

[Read more]

Dec

2014

Some Notes on Index Statistics in InnoDB

Posted by MySQL Server Dev Team on Mon 15 Dec 2014 13:04 UTC
Tags:

innodb, Statistics, optimizer, Performance, persistent statistics

In MySQL 5.6 we introduced a huge improvement in the way that index and table statistics are gathered by InnoDB and subsequently used by the Optimizer during query optimization: Persistent Statistics. Some aspects of the way that Persistent Statistics work could be improved further though, and we’d really like your input on that.

How much to sample?

The statistics are gathered by picking some pages semi-randomly, analyzing them, and deriving some conclusions about the entire table and/or index from those analyzed pages. The number of pages sampled can be specified on a per-table basis with the STATS_SAMPLE_PAGES clause. For example:

ALTER TABLE t STATS_SAMPLE_PAGES=500;

This way …

[Read more]

Jul

2013

What the Mean Really Means

Posted by Brendan Gregg on Fri 12 Jul 2013 17:07 UTC
Tags:

Uncategorized, Statistics, Performance, frequencytrail, visualizations, averages

When analyzing response time, or latency, you need much more information than an average provides. The average, commonly the arithmetic mean, shows the index of central tendency. But, as I found in earlier posts, the tendency is often not central, but may be skewed by outliers, or split by multiple modes. How often these factors occur was determined quantitatively, using tests and a survey of hundreds of production servers and different types of latency: over 95% had six-sigma outliers, and at least 20% had multiple modes. While these numerical results are useful, nothing beats a visualization, such as a histogram, …

[Read more]

Jul

2013

Modes and Modality

Posted by Brendan Gregg on Mon 08 Jul 2013 19:59 UTC
Tags:

Statistics, Performance, frequencytrail, visualizations

It is a truth universally acknowledged that the average is the index of central tendency. But what if the tendency isn’t central?

I’ve worked many performance issues where the latency or response time was multimodal, and higher-latency modes turned out to be the cause of the problem. Their existence isn’t shown by the average – the arithmetic mean; it could only be seen by examining the distribution as a histogram, density plot, heat map, or frequency trail. Once you know that more than one mode is present, it’s often straightforward to determine what causes the slower mode, by seeing what parameters of …

[Read more]

Jul

2012

Statistical functions in MySQL

Posted by Robert Eisele on Sat 14 Jul 2012 01:39 UTC
Tags:

PHP, udf, Statistics, Analysis, MySQL, infusion

Even in times of a growing market of specialized NoSQL databases, the relevance of traditional RDBMS doesn't decline. Especially when it comes to the calculation of aggregates based on complex data sets that can not be processed as a batch like Map&Reduce. MySQL is already bringing in a handful of aggregate functions that can be useful for a statistical analysis. The best known of this type are certainly:

Jul

2012

Statistical functions in MySQL

Posted by Robert Eisele on Sat 14 Jul 2012 01:39 UTC
Tags:

PHP, udf, Statistics, Analysis, MySQL, infusion

Top Authors

Oracle MySQL Blogs

Vendor Blogs

MySQL Links