Wikipedia is the greatest encyclopedia which has ever existed, because everyone can contribute to the massive knowledge corpus. Analyzing this data with computers is becoming more and more indispensable, as nobody can survey the information by hand anymore. In order to work with the data, we have to import it into MySQL and here is how it works.
8 Older Entries »
For me, the biggest news in the last 24 hours so far has been:
- SkySQL merges with Monty Program, developers of MariaDB. This of course affects me directly and leads to a change in affiliation in a few months.
- TokuDB goes opensource. I think this is really big news. Beyond just the fact that it can now be a storage engine in the main MariaDB tree, I love the work they’re doing to extend it to be an engine for MongoDB as well.
This is a nice blog post from Asher Feldman, Site Architect, Wikipedia on how Wikipedia Adopts MariaDB. If you’re using English or German Wikipedia, or using Wikidata, you’re currently being served by MariaDB 5.5.
I think MariaDB has had a great few weeks recently and the timeline of these events are important.
- 27 November 2012 – WiredTree Adds MariaDB for Faster MySQL Database Performance (well worth reading their motivations to switch)
- 29 November 2012 – Monty Program & SkySQL release the MariaDB Client Library for C & Java
- 4 December 2012 – MariaDB Foundation is announced, see ZDNet coverage.
- mid-December 2012 – Wikimedia Foundation starts migrating …
The Maatkit article on Wikipedia was removed some time ago, after being deemed not notable. I believe this is no longer the case. It’s hard to find a credible book published on MySQL in the last few years that doesn’t mention Maatkit, there’s quite a bit of blogging about it from MySQL experts and prominent community members, and the toolkit is certainly in wide use — it’s important enough that notable companies are supporting its development. It’s available through every major Unix-like operating system’s package repository. On Debian, it’s actually part of the mysql-client package, so if you install MySQL, you automatically get Maatkit too. I believe it’s probably the second most important set of MySQL command-line tools; the most …[Read more]
There has been a significant increase in interest in the Memcached, the open source distributed memory object-caching system, in recent months, as a number of vendors look to exploit its popularity in Web 2.0 and social networking environments.
Like Hadoop, which has become the focus of a number of commercial plays, it would appear that the time is right for commercialization of Memcached. But what is it, here did it come from, and what are the chances for vendors to rake in serious cash? Here are the details.
What is it?
Pronounced mem-cash-dee, Memcached was originally created by Danga Interactive (the developer of LiveJournal, which was acquired by Six Apart in 2005) to speed up the performance of dynamic Web applications by alleviating database load. Memcached has become an industry standard for improving the performance of dynamic websites.
The code is available from the …[Read more]
“When I was a child, my mother lectured me on the evils of
‘gossip.’ She held a feather pillow and said, ‘If I tear this
open, the feathers will fly to the four winds, and I could never
get them back in the pillow. That’s how it is when you spread
mean things about people.’ For me, that pillow is a metaphor for
Wikipedia.” — John Seigenthaler Sr.
So the future pulled up in her shiny big metaphor and we got in. In the beginning, our road trip made sense, all well-ordered highways and wholesome roadside attractions. Somewhere along the way, we hit bat country and the …[Read more]
Some recent news of interest:
Wikipedia is one of the world's most visited web sites (8th in the top 10, in fact), delivering an enormous breadth of content to an audience as vast as the internet.
But Wikipedia's evolved to become more than an on-line encyclopedia: they've become one of the world's largest search engines, they're a global source of real-time news, alongisde educational, political and health related content - and one of the world's most valuable brands and media properties.
Wikipedia's also a great example of a "redshift" application: a segment of the market that's growing faster than the technology industry's capacity to innovate. Technology companies have to pay special attention to such …[Read more]
8 Older Entries »