The Question Recently, a customer asked us:
After importing a new section of user data into our Tungsten
cluster, we are seeing perpetually rising replication lag. We are
sitting at 8.5hrs estimated convergence time after importing
around 50 million rows and this lag is climbing continuously. We
are currently migrating some of our users from a NoSQL database
into our Tungsten cluster. We have a procedure to write out a
bunch of CSV files after translating our old data into columns
and then we recursively send them to the write master using the
mysql client. Specifically our import SQL is doing LOAD
DATA LOCAL INFILE
and the reading in a large CSV file to
do the import. We have 20k records per CSV file and we have 12
workers which insert them in parallel.
Simple Overview The Skinny
In cases like this, the slaves are having trouble with the database unable to keep up with the apply stage …
[Read more]