Since starting the innotop and mysqltoolkit projects on Sourceforge, I have learned a lot about how to use source control more effectively -- especially how branching and tagging can be used. Still, I have limited experience. I want to package all the tools in MySQL Toolkit together and release them in one archive, but I don't know the best way to do it; every idea seems to have drawbacks. Read on for the details, and if you have suggestions, would you please leave comments for me?
MySQL Table Checksum 1.1.6 enhances chunking, adds features and fixes bugs. The chunking functionality is where I continue to put most of my effort. This release's behavior is incompatible with the last release, and it will probably change again in the future.
Thanks to everyone who has been helping me chase down bugs, including one user who sent me a major patch! It's a great feeling to get a patch.
Dathan says that 128k Stripe Size is ideal for INNODB:
128K - this is really good for INNODB, you’ll see a huge boost in responsiveness by making your Stripe Size 128K. I had a 64K stripe size, and I was blown away by the improvement of 128K
I’ve only benchmarked 64k and 1M. Google is using a 1M stripe size. 1M was about 5x slower for our workload. I’m going to have to benchmark 128k.
Interesting that he recommends RAID 10. We’re using RAID 0 and just using a Redundant Array of Inexpensive Servers. If one fails we just promote another one as master.
Announced at the 2007 Conference as MySQL Applications of the
Year - #1 in 3G Mobile Entertainment, Amp’d Mobile is no longer
the poster boy.
Amp?d Mobile Implodes: Burns $360 million, Declares Bankruptcy. Wow, that’s news on a Sunday.
I have some warm up questions on how MySQL handles LOBs internally and then some questions on high availability. Especially with HA I feel like I have heard all/most the solutions, but they all seem to lack in one area that really makes you hurt, but I keep stumbling over questions from people asking me which way to go. Finally I have a little backup question for extra credit at the end.
So for the LOB questions, I am wondering if MySQL (or any of the storage engines) store the actual LOB data on separate data pages or not. I am also wondering if the MySQL query cache has any special handling for LOBs, like ignoring all queries that fetch LOB data. I guess there is a setting to set the maximum size for a result set to go into the query cache that could be employed to prevent a few large LOBs to fill up the query cache, but then again you …
[Read more]
The part of software building I called essence is the mental
crafting of the conceptual construct; the part I called accident
is its implementation process --Frederick Brooks1
Data warehousing involves considerable amounts of accidental and essential complexity. While being a satisfying pursuit when successful, data
warehousing is also decidedly prone to failure. You need to learn
to manage the essential complexity and eliminate the
accidental.
Data warehousing deals with accidental complexity resulting from
availability of …
We've been running replication poll and we've got some answers, so I thought I would comment a little on the results of the poll and what our future plans with respect to replication is as a result of the feedback. As I commented in the previous post, there are some items that require a significant development effort, but the feedback we got helps us to prioritize.
The top five items from the poll above stands out, so I thought that I would comment on each of them in turn. The results of the poll were (when this post were written):
| Online check that Master and Slave tables are consistent | 45.4% |
| Multi-source replication: replicating from … |
Disclaimer - views expressed in this blog (and this entry)
are my own and do not necessarily reflect the views of MySQL
AB
Ever since I wrote my blog entry about Google Gears and the query tool for the
browser embedded offline Google Gears database service, I have been
wondering how MySQL might fit in here.
I have heard an idea to write a MySQL storage engine for SQLite and although I
do not think this is necessarily a bad idea, I don't think it
will be immediately useful for typical …
jag.singh@dwoptimize.com
Optimizing performance and usability of large data warehouses
using parallelism, partitions, and aggregate summaries, in
conjunction with dimensional modeling, across the complete
application stack comprising of ETL, databases, and
reporting.
This is a personal blog with personal opinions.
Banner picture adapted from wikimedia.
Since starting the innotop and mysqltoolkit projects on Sourceforge, I have learned a lot about how to use source control more effectively – especially how branching and tagging can be used. Still, I have limited experience. I want to package all the tools in MySQL Toolkit together and release them in one archive, but I don’t know the best way to do it; every idea seems to have drawbacks. Read on for the details, and if you have suggestions, would you please leave comments for me?