In TokuMX 1.5 that is right around the corner, the big feature will be partitioned collections. This feature is similar to partitioned tables in Oracle, MySQL, SQL Server, and Postgres. A question many have is “why should I use partitioned tables?” In short, it’s complicated. The answer depends on your workload, your schema, and your database of choice. For example, this …[Read more...]
May 19, 2014 By Severalnines
Thanks to everyone who attended and participated last week’s joint webinar on ClusterControl 1.2.6! We had great questions from participants (thank you), most of which are transcribed below with our answers to them.
If you missed the sessions or would like to watch the webinar again & browse through the slides, they are now available online.
Webinar topics discussed:
MongoDB, TokuMX and MySQL use log files with high-value data. For
MongoDB this is the journal that uses direct IO. For MySQL this
is the binlog, relay log and InnoDB redo log, all use
buffered IO by default, the InnoDB redo log uses 512 bytes as the
"page size" and the replication logs have no notion of page
The minimum size for a write to a storage device is either the sector size or filesystem page size. The sector size is either 512 or 4096 bytes today. In the future it will be 4096 or larger. The filesystem …
Background: If you did not read my first blog post about why I am sharing my thoughts on the benchmarks published by Mark Callaghan on Small Datum you may want to skim through it now for a little context: “Thoughts on Small Datum – Part 1”
“As the database gets used, shards can grow at an uneven rate and one shard might carry a majority of the load. MongoDB corrects this by balancing shards, but because of MongoDB’s lack of concurrency this operation can stall the database unacceptably.”–John Partridge.
I have interviewed John Partridge, President & CEO of Tokutek, Inc.
Q1. Tokutek recently announced to have eliminated performance issues of MongoDB sharding. What was the problem?
John …[Read more...]
May 7, 2014 By Severalnines
Following the release of ClusterControl 1.2.6 a couple of weeks ago, we are now looking forward to demonstrating this latest version of the product on Tuesday next week, May 13th.
This release contains key new features (along with performance improvements and bug fixes), which we will be demonstrating live during the webinar. …[Read more...]
Yesterday I learnt an important lesson about an important
difference between NoSQL and MySQL, at least when it comes to the
marketing and hype.
I saw a tweet from around marketing of one of NoSQL leaders:
Most people apparently would just conclude from the tweet's text, however I actually clicked the link, and couldn't believe eyes:
I guess that in …
This continues my work on using a very simple workload (read-only, fetch 1 document by PK, database larger than RAM, fast storage) to understand how to get more QPS from TokuMX and MongoDB. I need to get more random read IOPs from storage to get more QPS from the DBMS. Beyond getting more throughput I ran tests to understand the impact of filesystem readahead on MongoDB and TokuMX and the impact of the read page size on TokuMX and InnoDB. My results are based on fast flash storage that can do more than 100,000 4kb reads/second. Be …[Read more...]
The title of this post should really be, “Maybe He Should Try Taking a Walk in Your Shoes.”
The he I’m referring to is economist and author, Tim Harford. The you is the people who use NewSQL and NoSQL approaches to mine big data with database platforms like MySQL and MongoDB (or, preferably, our high-performance distributions of them, …[Read more...]