Planet MySQL

Displaying posts with tag: hadoop (reset)

Oct

2013

New MySQL features, related technologies at Percona Live London

Posted by Alexander Rubin of MySQL Performance Blog on Wed 23 Oct 2013 04:01 UTC
Tags:

Oracle, hadoop, facebook, mariadb, TokuDB, mongodb, cassandra, percona live, Events and Announcements, MySQL, Percona Live London, MySQL 5.6 new features

The upcoming Percona Live London conference, November 11-12, features quite a number of talks about the latest MySQL features and related technologies. There will be a lots of talks about the new MySQL 5.6 features:

Opening keynote highlights MySQL 5.6 new features.
New InnoDB Compression talk will cover the new compression algorithm, implemented by Facebook and included in MySQL 5.6.
New …

[Read more]

Sep

2013

SQL to Hadoop and back again, Part 1: Basic data interchange techniques

Posted by MC Brown on Wed 25 Sep 2013 11:47 UTC
Tags:

Articles, Databases, hadoop, datamining, MySQL, ibmdeveloperworks

I’ve got a new article, which is part of a new three-part series, on moving data between SQL and Hadoop, both the export to Hadoop and importing processed content back into an SQL store.

In this first one, we look at the basic mechanics and considerations before you start the migration of data, such as the data format, content, and export techniques.

Read: SQL to Hadoop and back again, Part 1: Basic data interchange techniques

Sep

2013

Data Analytics at NBCUniversal. Interview with Matthew Eric Bassett.

Posted by Roberto V. Zicari on Mon 23 Sep 2013 14:48 UTC
Tags:

Open Source, Uncategorized, Python, amazon, cloud, analytics, hadoop, MapReduce, big data, NoSQL, MySQL, cloud stores, nosql databases, relational databases, Amazon's EC, Elastic MapReduce, Matthew Eric Bassett, NBCUniversal

“The most valuable thing I’ve learned in this role is that judicious use of a little bit of knowledge can go a long way. I’ve seen colleagues and other companies get caught up in the “Big Data” craze by spend hundreds of thousands of pounds sterling on a Hadoop cluster that sees a few megabytes [...]

Sep

2013

Percona Live London 2013: an insider’s view of the schedule

Posted by MySQL Performance Blog on Wed 18 Sep 2013 05:00 UTC
Tags:

Benchmarks, hadoop, Hive, percona live, Events and Announcements, Hardware and Storage, Insight for DBAs, Insight for Developers, MySQL, PerconaLive, Percona XtraBackup, PLUK, Percona Live London 2013, PLUK13

With the close of call for papers earlier this month, the Percona Live London conference committee was in full swing this past week reviewing all of the many submissions for November’s Percona Live London MySQL Conference.

The submissions are far ranging and cover some really interesting topics, making the lineup for Percona Live London really strong! What the committee looks for in a submission is how much “value” a talk will bring to the conference – this is to say it needs to be far more that a product demo. As such, real-world experiences are receiving much more favorable reviews, along with talks that cover methodologies the attendees will …

[Read more]

Sep

2013

MySQL webinar: ‘Introduction to open source column stores’

Posted by Justin Swanhart of MySQL Performance Blog on Thu 12 Sep 2013 21:39 UTC
Tags:

olap, analytics, hadoop, Infobright, luciddb, MonetDB, column stores, Justin Swanhart, MySQL, Impala, MySQL Webinars

Join me Wednesday, September 18 at 10 a.m. PDT for an hour-long webinar where I will introduce the basic concepts behind column store technology. The webinar’s title is: “Introduction to open source column stores.”

What will be discussed?

This webinar will talk about Infobright, LucidDB, MonetDB, Hadoop (Impala) and other column stores

I will compare features between major column stores (both open and closed source).
Some benchmarks will be used to demonstrate the basic performance characteristics of the open source column stores.
There will be a question and answer session to ask me anything you like about column stores (you can also ask in the …

[Read more]

Aug

2013

Copying MySQL Data to Hadoop with Minimal Loss of Blood Part 1

Posted by Dave Stokes on Tue 27 Aug 2013 20:40 UTC
Tags:

hadoop, big data, MySQL

Ask ten DBAs for a definition of ‘Big Data’ and you well get more than ten replies. And the majority of those replies will lead you to Hadoop. Hadoop has been the most prominent of the big data frameworks in the open source world. Over 80% of the Hadoop instances in the world are feed their data from MySQL1. But Hadoop is made up of many parts, some confusing and many that do not play nicely with each other. It is analogous to being given a pile of automotive parts from different models and tyring to come up with a car at the end of the day. So what if you do if you are wanting to copy some of your relational data into Hadoop and want to avoid the equivilent of scraped knuckles? The answer is Bigtop and what follows is a way to get a one node does all system running so you can experiement with Hadoop, Map/Reduce, Hive, and all the other parts.

Bigtop is an Apache Project self …

[Read more]

Aug

2013

Big Data with MySQL and Hadoop at MySQL Connect 2013

Posted by Alexander Rubin of MySQL Performance Blog on Thu 08 Aug 2013 10:00 UTC
Tags:

hadoop, big data, sqoop, Hive, MySQL, flume, MySQL Connect 2013, Alexander Rubin

I will be talking about Big Data with MySQL and Hadoop at MySQL Connect 2013 (Sept. 21-22) in San Francisco as well as at Percona University at Washington, DC (September 12, 2013). Apache Hadoop is a very popular Big Data solution and we can nowadays easily integrate it with MySQL. I will start with a brief introduction of Apache Hadoop and its components (HFDS, Map/Reduce, Hive, HBase/HCatalog, Flume, Scoop, etc). Next I will show 2 major Big Data scenarios:

From file to Hadoop to MySQL. This is an example of “ELT” process: Extract data from external source; Load data into Hadoop; Transform data/Analyze data; Extract results to MySQL. It is similar to the original Data Warehouse ETL …

[Read more]

Jul

2013

MySQL and Hadoop integration

Posted by Alexander Rubin of MySQL Performance Blog on Thu 11 Jul 2013 10:00 UTC
Tags:

hadoop, sqoop, Hive, Insight for DBAs, MySQL, Apache Hadoop, Data Science, no sql

Dolphin and Elephant: an Introduction

This post is intended for MySQL DBAs or Sysadmins who need to start using Apache Hadoop and want to integrate those 2 solutions. In this post I will cover some basic information about the Hadoop, focusing on Hive as well as MySQL and Hadoop/Hive integration.

First of all, if you were dealing with MySQL or any other relational database most of your professional life (like I was), Hadoop may look different. Very different. Apparently, Hadoop is the opposite to any relational database. Unlike the database where we have a set of tables and indexes, Hadoop works with a set of text files. And… there are no indexes at all. And yes, this may be shocking, but all scans are sequential (full “table” scans in MySQL terms).

So, when does Hadoop makes sense?

First, Hadoop is great if you need to …

[Read more]

Jul

2013

On Oracle NoSQL Database –Interview with Dave Segleau.

Posted by Roberto V. Zicari on Tue 02 Jul 2013 07:18 UTC
Tags:

Java, Open Source, Uncategorized, couchdb, hadoop, big data, NoSQL, mongodb, cassandra, riak, MySQL, MySQL 5.6, Amazon-Dynamo, nosql databases, Oracle NoSQL Database, document stores, New and old Data stores, Dave Segleau, LinkedIn Voldemort, Yammer, YCSB benchmark

“We went down the path of building Oracle NoSQL database because of explicit request from some of our largest Oracle Berkeley DB installations that wanted to move away from maintaining home grown sharding implementations and very much wanted an out of box technology that can replicate the robustness of what they had built “out of [...]

Jun

2013

What technologies are you running alongside MySQL?

Posted by MySQL Performance Blog on Wed 19 Jun 2013 10:00 UTC
Tags:

lucene, memcached, sphinx, hadoop, hbase, mongodb, RethinkDB, cassandra, redis, riak, couchbase, MySQL, Tarantool, polls, Inhouse Developed Technology

In many environments MySQL is not the only technology used to store in-process data.

Quite frequently, especially with large-scale or complicated applications, we use MySQL alongside other technologies for certain tasks of reporting, caching as well as main data-store for portions of application.

What technologies for data storage and processing do you use alongside MySQL in your environment? Please feel free to elaborate in the comments about your use case and experiences!

Note: There is a poll embedded within this post, please visit the site to participate in this post's poll.

The post What technologies are you running alongside MySQL? appeared first on …

[Read more]

Top Authors

Oracle MySQL Blogs

Vendor Blogs

MySQL Links