Planet MySQL

Displaying posts with tag: Multimaster (reset)

Aug

2019

10 Reasons Why Tungsten Clustering Beats the DIY Approach for Geo-Distributed MySQL Deployments

Posted by Continuent on Thu 29 Aug 2019 20:51 UTC
Tags:

High Availability, maintenance, proxy, Clustering, connector, aws, Disaster Recovery, MySQL, replicator, multisite, Multimaster, Multimaster MySQL, Geo-Distributed

Why does the DIY approach fail to deliver vs. the Tungsten Clustering solution for geo-distributed MySQL multimaster deployments?

Before we dive into the 10 reasons, note why commercially-supported enterprise software is less risky and in fact less costly:

The labor time spent building and maintaining a DIY solution costs more than a supported solution that just works.
There is documentation, training, support, so your mission-critical process is never dependent upon an irreplaceable individual.

Tungsten Clustering is a complete solution, comprised of the Replicator, Manager and Connector components
- With DIY, you must first decide the architecture, then select the individual tools to handle each layer of the topology. …

[Read more]

Jun

2019

What on Earth is a Split-Brain Scenario in a MySQL Database Cluster?

Posted by Continuent on Mon 24 Jun 2019 14:14 UTC
Tags:

ha, High Availability, tungsten, failsafe, MySQL, Multimaster, Mastering Tungsten Clustering, Tungsten Connector, Split-Brain

Overview The Skinny

In this blog post we will define what a split-brain scenario means in a MySQL database cluster, and then explore how a Tungsten MySQL database cluster reacts to a split-brain situation.

Agenda What’s Here?

Define the term “split-brain”
Briefly explore how the Tungsten Manager works to monitor the cluster health and prevent data corruption in the event of a network partition
Also explore how the Tungsten Connector works to route writes
Describe how a Tungsten MySQL database cluster reacts to a split-brain situation
Illustrate various testing and recovery procedures

Split-Brain: Definition and Impact Sounds scary, and it is!

A split-brain occurs when a MySQL database cluster which normally has …

[Read more]

May

2019

The Important Role of a Tungsten Rollback Error

Posted by Continuent on Fri 24 May 2019 21:01 UTC
Tags:

monitoring, ha, High Availability, Architecture, Nagios, MySQL, multisite, Multimaster, NRPE, Mastering Tungsten Replicator

The Question Recently, a customer asked us:

What is the meaning of this error message found in trepsvc.log?

2019/05/14 01:48:04.973 | mysql02.prod.example.com | [east - binlog-to-q-0] INFO pipeline.SingleThreadStageTask Performing rollback of possible partial transaction: seqno=(unavailable)

Simple Overview The Skinny

This message is an indication that we are dropping any uncommitted or incomplete data read from the MySQL binary logs due to a pending error.

The Answer Safety First

This error is often seen before another error and is an indication that we are rolling back anything uncommitted, for safety. On a master this is normally very little and would likely be internal transactions in the trep_commit_seqno table, for example.

As you may know with the replicator we always extract complete transactions, and so this particular message is …

[Read more]

May

2019

Understanding Cross-Site Replication in a Tungsten Composite Multi-Master Cluster for MySQL, MariaDB and Percona Server

Posted by Continuent on Wed 22 May 2019 18:42 UTC
Tags:

Replication, monitoring, ha, High Availability, Architecture, site, MySQL, multisite, v6, Multi Master, Multimaster, Mastering Tungsten Clustering, CMM, Cross, Cross-site, MSMM

Overview The Skinny

In this blog post we will discuss how the managed cross-site replication streams work in a Composite Multi-Master Tungsten Cluster for MySQL, MariaDB and Percona Server.

Agenda What’s Here?

Briefly explore how managed cross-site replication works in a Tungsten Composite Multi-Master Cluster
Describe the reasons why the default design was chosen
Explain the pros and cons of changing the configuration
Examine how to change the configuration of the managed cross-site replicators

Cross-Site Replication A Very Brief Summary

In a standard Composite Multi-Master (CMM) deployment, the managed cross-site replicators pull Transaction History Logs (THL) from every remote cluster’s current master node. …

[Read more]

May

2019

Performance Tuning Tungsten Replication to MySQL

Posted by Continuent on Tue 21 May 2019 15:57 UTC
Tags:

monitoring, ha, High Availability, Architecture, Nagios, MySQL, multisite, Multimaster, NRPE, Mastering Tungsten Replicator

The Question Recently, a customer asked us:

Why would Tungsten Replicator be slow to apply to MySQL?

The Answer Performance Tuning 101

When you run trepctl status and see:
appliedLatency : 7332.394
like this on a slave, it is almost always due to the inability for the target database to keep up with the applier.

This means that we often need to look first to the database layer for the solution.

Here are some of the things to think about when dealing with this issue:

Architecture and Environment
√ Are you on bare metal?
√ Using the cloud?
√ Dev or Prod?
√ Network speed and latency?
√ Distance the data needs to travel?
√ Network round trip times? Is the …

[Read more]

May

2019

Troubleshooting Data Differences in a MySQL Database Cluster

Posted by Continuent on Thu 09 May 2019 16:40 UTC
Tags:

Replication, Architecture, Percona, MySQL, replicator, Tungsten replicator, pt-table-checksum, multisite, pt-table-sync, Multimaster, Mastering Tungsten Replicator, Tungsten Clustering

Overview The Skinny

From time to time we are asked how to check whether or not there are data discrepancies between Master/Slave nodes within a MySQL (or MariaDB) cluster that’s managed with Tungsten Clustering. This is always a challenging task, not least because we hope and believe that our replication mechanism would avoid such occurrences, that said there can be factors outside of our control that can appear to “corrupt” data – such as inadvertent execution of DML against a slave using a root level user account.

Tungsten Replicator, the core replication component in our Tungsten Clustering solution for MySQL (& MariaDB), is just that, a replicator – it takes transactions from the binary logs and replicates them around. The replicator isn’t a data synchronisation tool in that respect, the …

[Read more]

May

2019

SSH Differences Between Staging and INI Configuration Methods

Posted by Continuent on Tue 07 May 2019 19:54 UTC
Tags:

monitoring, ha, High Availability, Architecture, Nagios, MySQL, multisite, Multimaster, NRPE, Mastering Tungsten Clustering

The Question Recently, a customer asked us:

If we move to using the INI configuration method instead of staging, would password-less SSH still be required?

The Answer The answer is both “Yes” and “No”

No, for installation and updates/upgrades specifically. Since INI-based configurations force the tpm command to act upon the local host only for installs and updates/upgrades, password-less SSH is not required.

Yes, because there are certain commands that do rely upon password-less SSH to function. These are:

tungsten_provision_slave
prov-sl.sh
multi_trepctl
tpm diag (pre-6.0.5)
tpm diag --hosts (>= 6.0.5)
Any tpm-based backup and restore operations that involve a remote node

Summary The Wrap-Up

In …

[Read more]

Apr

2019

How to Integrate Tungsten Clustering Monitoring Tools with PagerDuty Alerts

Posted by Continuent on Tue 23 Apr 2019 20:07 UTC
Tags:

monitoring, ha, High Availability, Architecture, Nagios, MySQL, multisite, Multimaster, NRPE, Mastering Tungsten Clustering

Overview The Skinny

In this blog post we will discuss how to best integrate various Continuent-bundled cluster monitoring solutions with PagerDuty (pagerduty.com), a popular alerting service.

Agenda What’s Here?

Briefly explore the bundled cluster monitoring tools
Describe the procedure for establishing alerting via PagerDuty
Examine some of the multiple monitoring tools included with the Continuent Tungsten Clustering software, and provide examples of how to send an email to PagerDuty from each of the tools.

Exploring the Bundled Cluster Monitoring Tools A Brief Summary

Continuent provides multiple methods out of the box to monitor the cluster health. The most popular is the suite of Nagios/NRPE scripts (i.e. cluster-home/bin/check_tungsten_*). We also have Zabbix scripts (i.e. cluster-home/bin/zabbix_tungsten_*). Additionally, there is …

[Read more]

Oct

2018

Tungsten Clustering versus AWS RDS/MySQL

Posted by Continuent on Thu 25 Oct 2018 15:35 UTC
Tags:

cloud, High Availability, aws, Disaster Recovery, RDS, MySQL, multisite, Multimaster, MySQL High Availability And Disaster Recovery

Enterprises require high availability for their business-critical applications. Even the smallest unplanned outage or even a planned maintenance operation can cause lost sales, productivity, and erode customer confidence. Additionally, updating and retrieving data needs to be robust to keep up with user demand.

Let’s take a look at how Tungsten Clustering helps enterprises keep their data available and globally scalable, and compare it to Amazon’s RDS running MySQL (RDS/MySQL).

Replicas and Failover What does RDS do?

Having multiple copies of a database is ideal for high availability. RDS/MySQL approaches this with “Multi-AZ” deployments. The term “Multi-AZ” here is a bit confusing, as enabling this simply means a single “failover replica” will be created in a different availability zone from the primary database instance. …

[Read more]

Oct

2018

No-Downtime Cluster Software Upgrades

Posted by Continuent on Tue 23 Oct 2018 14:30 UTC
Tags:

monitoring, ha, High Availability, Architecture, Nagios, MySQL, multisite, Multimaster, NRPE, Mastering Tungsten Clustering

One important way to protect your data is to keep your Tungsten Clustering software up-to-date.

A standard cluster deployment uses three nodes, which allows for no-downtime upgrades along with the ability to have a fully available cluster during maintenance.

Please note that with only two database cluster nodes, there is a window of vulnerability created by leaving zero failover candidates available when the lone slave is taken down for service.

The Best Practices: Staging Performing a No-Downtime Upgrade for a Staging Deployment

When upgrading a Staging-style deployment, all nodes are upgraded at once in parallel via the tools/tpm update command run from inside the staging directory on the staging host.

No Master switch happens, and all layers are restarted to use the new code. …

[Read more]

Top Authors

Oracle MySQL Blogs

Vendor Blogs

MySQL Links