“Big Data” offers the potential for organizations to revolutionize their operations. With the volume of business data doubling every 1.2 years, analysts and business users are discovering very real benefits when integrating and analyzing data from multiple sources, enabling deeper insight into their customers, partners, and business processes.
As the world’s most popular open source database, and the most deployed database in the web and cloud, MySQL is a key component of many big data platforms, with Hadoop vendors estimating 80% of deployments are integrated with MySQL.
The new Guide to MySQL and Hadoop (http://www.mysql.com/why-mysql/white-papers/mysql_wp_hadoop.php) presents the tools enabling integration between the two data platforms, supporting the data lifecycle from[Read more...]
"Last night, the world tuned in to Twitter to share the election results as U.S. voters chose a president and settled many other campaigns. Throughout the day, people sent more than 31 million election-related Tweets (which contained certain key terms and relevant hashtags). And as results rolled in, we tracked the surge in election-related Tweets at 327,452 Tweets per minute (TPM). These numbers reflect the largest election-related Twitter conversation during our 6 years of existence, though they don’t capture the total volume of all Tweets yesterday."
"Last night, Twitter averaged about 9,965 TPS from 8:11pm to 9:11pm PT, with a[Read more...]
The on-demand webinar for the MySQL Cluster 7.3 Development Release is now available.
You can learn more about the design, implementation and getting started with all of the new MySQL Cluster 7.3 features from the comfort and convenience of your own device, including:
- Auto-installation of higher performance distributed, clusters
We received some great questions over the course of the webinar, and I wanted to share[Read more...]
The first Development Milestone and Early Access releases of MySQL Cluster 7.3 were announced just several weeks ago. To provide more detail and demonstrate the new features, Andrew Morgan and I will be hosting a live webinar (http://www.mysql.com/news-and-events/web-seminars/display-726.html) this coming Thursday 25th October at 0900 Pacific Time / 16.00 UTC
Even if you can't make the live webinar, it is still worth registering for the event (http://www.mysql.com/news-and-events/web-seminars/display-726.html) as you will receive a notification when the replay will be available, to view on-demand at your convenience
In the webinar, we will discuss the enhancements being previewed as part of MySQL Cluster 7.3, including:[Read more...]
The tutorial was organized as follows:
At this weeks MySQL Connect conference, Oracle previewed an exciting new wave of developments for MySQL Cluster (http://www.mysql.com/products/cluster/), further extending its simplicity and flexibility by expanding the range of use-cases, adding new NoSQL options, and automating configuration.
In this blog, I'll introduce you to the features being previewed.
Review the blogs listed below for more detail on each of the specific features discussed.
Save the date!: A live webinar is scheduled for Thursday 25th[Read more...]
In prior years a daily update from Open World was possible, but this year my schedule was too full to support it. This is my compendium of thoughts about MySQL Connect, JavaOne, and Open World 2012.
MySQL Connect was great – good sessions re-enforcing the positive investments Oracle is making in the product. I’ll leave to others to qualify changes in what elements of technology are opened or closed along the road to a better MySQL. The announcement of Connector/Python 1.0 GA on Saturday was great news and as a community we owe a lot to Greet Vanderkelen.[Read more...]
Tutorial authored by Craig Russell and JD Duncan
- native "NoSQL" access to the storage layer without going first through SQL transformations and parsing.
While the initial implementation is designed to plug and play with Node.js, the actual[Read more...]
In my three previous blogs I wrote about our implementation of Fractal Tree Indexes on MongoDB, showing a 10x insertion performance increase, a 268x query performance increase, and a comparison of covered indexes and clustered indexes. The benchmarks show the difference that rich and efficient indexing can make to your MongoDB workload.
It’s one thing for us to benchmark MongoDB + TokuDB and another to measure real world performance. If you are looking for a way to improve the performance or[Read more...]
Last week I wrote about our 10x insertion performance increase with MongoDB. We’ve continued our experimental integration of Fractal Tree® Indexes into MongoDB, adding support for clustered indexes. A clustered index stores all non-index fields as the “value” portion of the index, as opposed to a standard MongoDB index that stores a pointer to the document data. The benefit is that indexed lookups can immediately return any requested values instead of needing to do an additional lookup (and potential disk IOs) for the requested fields.
To create a clustered index you just need to add[Read more...]
The challenge of handling massive data processing workloads has spawned many new innovations and techniques in the database world, from indexing innovations like our Fractal Tree® technology to a myriad of “NoSQL” solutions (here is our Chief Scientist’s perspective). Among the most popular and widely adopted NoSQL solutions is MongoDB and we became curious if our Fractal Tree indexing could offer some advantage when combined with it. The answer seems to be a strong “yes”.
Earlier in the summer we kicked off a small side project and here’s what we did: we implemented a “version 2” IndexInterface as a Fractal Tree index and ran some benchmarks. Note that our integration only affects MongoDB’s secondary indexes;[Read more...]
What is Ansible?
Ansible is a configuration management and deployment system, like Puppet, Capistrano, Fabric, and Chef. Its aim is to be radically simple and let you use your existing scripts to help with cluster configuration and software deployment whenever possible. Here are the ways that Ansible differentiates itself.
Ansible does not include a client/server architecture with pull-based clients (although in more recent versions, it does include pull-based configuration and deployment). Rather, it uses pre-existing network infrastructure: SSH. Every company has SSH installed on their cluster servers, and Ansible simply rides on top of this infrastructure to get the code and configuration out to the nodes.
… Ya se que estás piantao, piantao, piantao…
For my lastest blog, a review of the MySQL, NoSQL and Cloud Conference, I’ll continue to use the tango metaphor. Balada para un loco (ballad for a crazy one) is a Piazzola classic and explains what I think of Santiago Lertora from Binlogic for single handedly putting together this event; he had to be piantao (slang for ‘crazy’) to pursue his vision to kick start the Open Source database community in South America into becoming as active as it is in the US and Europe. He was able to gather some renowned speakers such as our own Martin Farach-Colton, Sheeri Cabral from Mozilla, Max Mether and Massimo Brignoli from SkySQL, Colin Charles from Monty Program, Alejandro Kojima[Read more...]
The MySQL Cluster engineering team recently ran a live webinar, available now on-demand (http://www.mysql.com/news-and-events/on-demand-webinars/display-od-716.html) demonstrating the ClusterJ and ClusterJPA NoSQL APIs for MySQL Cluster (http://www.mysql.com/products/cluster/), and how these can be used in building real-time, high scale Java-based services that require continuous availability.
Attendees asked a number of great questions during the webinar, and I thought it would be useful to share those here, so others are also able to learn more about the Java NoSQL APIs.
First, a little bit about why we developed these APIs and why they are interesting to Java developers.
ClusterJ and Cluster JPA
ClusterJ is a Java interface to MySQL Cluster that provides either a static or dynamic domain object model, similar to the data model used[Read more...]
Hosted by Binlogic, the MySQL/NoSQL/Cloud Latin American Conference at the Hilton Buenos Aires in Argentina, June 26-28th, will bring together key members of the of open source database community for two intense days of technical talks and tutorials on popular open source databases like MySQL, MariaDB, and Drizzle; NoSQL databases such as MongoDB and CouchDB; and related technologies such as the Soir and Sphinx search engines.
The event will also feature keynotes from industry leaders, and numerous opportunities for networking and socializing with one’s peers.
Already, over 250 Latin American DBAs, programmers, and system admins responsible for deploying databases in the enterprise and cloud, have registered for[Read more...]
The signal-to-noise ratio in the NoSQL world has made it hard to figure out what’s going on, or even who has something new. For all the talk of performance in the NoSQL world, much of the most exciting part of what’s new is really not about performance at all.
Take for example, MongoDB, which has a really great data model and MapReduce has a very handy scripting language. These are genuine and probably long-lasting contributions. Their innovation is all about finding a new language to use for interacting with data. They are about NoSQL.
The confusion comes, for me, when we get to the performance side of the equation. I explore this in detail in an article I did for Datanami recently – http://www.datanami.com/datanami/2012-05-22/the_sound_and_the_nosql_fury.html.
NoSQL and NewSQL database technologies pose a long-term competitive threat to MySQL’s position as the default database for Web applications, according to a new report published by 451 Research.
The report, MySQL vs. NoSQL and NewSQL: 2011-2015, examines the competitive dynamic between MySQL and the emerging NoSQL non-relational, and NewSQL relational database technologies.
It concludes that while the current impact of NoSQL and NewSQL database technologies on MySQL is minimal, they pose a long-term competitive threat due to their adoption for new development projects. The report includes market sizing and growth estimates, with the key findings as follows:
• NoSQL software vendors generated revenue* of $20m in 2011. NoSQL software revenue is expected to rapidly grow at a CAGR of 82% to reach $215m by[Read more...]
MongoDB Backup types and strategies are neatly explained in its documentation, which you can check here. In case you are not familiar with MongoDB backup types and strategies, please have a look at its documentation.
What I am describing here is a simple script which we are using since months to take MongoDB backup and transfer it over to our Backup server. Here are few things its doing: