Planet MySQL

Displaying posts with tag: hadoop (reset)

May

2014

Webinar-on-Demand: Set Up & Operate Open Source Oracle Replication

Posted by Petri Virsunen of Continuent on Fri 30 May 2014 18:41 UTC
Tags:

Oracle, hadoop, vertica, MySQL, Continuent Tungsten, Continuent Tungsten Replicator

Oracle's expensive and complex replication makes it difficult to build cost-effective applications that move data in real-time to data warehouses (Oracle, Hadoop, Vertica) and popular databases like MySQL. Fortunately, Continuent Tungsten offers a solution.In this virtual course, you will learn how Continuent Tungsten solves problems with Oracle replication at a fraction of the cost of other

May

2014

Continuent at Hadoop Summit

Posted by MC Brown on Fri 30 May 2014 09:04 UTC
Tags:

Oracle, hadoop, continuent, big data, MySQL, Presentations and Conferences

I’m pleased to say that Continuent will be at the Hadoop Summit in San Jose next week (3-5 June). Sadly I will not be attending as I’m taking an exam next week, but my colleagues Robert Hodges, Eero Teerikorpi and Petri Versunen will be there to answer any questions you have about Continuent products, and, of course, Hadoop replication support built into Tungsten Replicator 3.0.

If you are at the conference, please go along and say hi to the team. And, as always, if there are any questions please let them or me know.

Filed under: Presentations and Conferences Tagged: big data, continuent, …

[Read more]

May

2014

Webinar-on-demand: Set up & operate real-time data loading into Hadoop

Posted by Petri Virsunen of Continuent on Thu 29 May 2014 19:32 UTC
Tags:

Oracle, hadoop, mysql replication, big data, MySQL, Continuent Tungsten, Continuent Tungsten Replicator

Getting data into Hadoop is not difficult, but it is complex if you want to load 'live' or semi-live data into your Hadoop cluster from your Oracle and MySQL databases. There are plenty of solutions available, from manually dumping and loading to the good and bad sides of using a tool like Sqoop. Neither are easy and both prone to the problems of lag between the moment you perform the dump and

May

2014

Real-Time Data Movement: The Key to Enabling Live Analytics With Hadoop

Posted by MC Brown on Thu 22 May 2014 20:40 UTC
Tags:

Oracle, Articles, Databases, hadoop, big data, MySQL

An article about moving data into Hadoop in real-time has just been published over at DBTA, written by me and my CEO Robert Hodges.

In the article I talk about one of the major issues for all people deploying databases in the modern heterogenous world – how do we move and migrate data effectively between entirely different database systems in a way that is efficient and usable. How do you get the data you need to the database you need it in. If your source is a transactional database, how does that data get moved into Hadoop in a way that makes the data usable to be queried by Hive, Impala or HBase?

You can read the full article here: Real-Time Data Movement: The Key to Enabling Live Analytics With Hadoop

Filed under: …

[Read more]

May

2014

Archival and Analytics - Importing MySQL data into Hadoop Cluster using Sqoop

Posted by Severalnines on Fri 16 May 2014 04:46 UTC
Tags:

Other, analytics, hadoop, mariadb, sqoop, galera, MySQL, archival

May 16, 2014 By Severalnines

We won’t bore you with buzzwords like volume, velocity and variety. This post is for MySQL users who want to get their hands dirty with Hadoop, so roll up your sleeves and prepare for work. Why would you ever want to move MySQL data into Hadoop? One good reason is archival and analytics. You might not want to delete old data, but rather move it into Hadoop and make it available for further analysis at a later stage.

In this post, we are going to deploy a Hadoop Cluster and export data in bulk from a Galera Cluster using Apache Sqoop. Sqoop is a well-proven approach for bulk data loading from a relational database into Hadoop File System. There is also Hadoop Applier available from …

[Read more]

May

2014

Cross your Fingers for Tech14, see you at OSCON

Posted by MC Brown on Thu 15 May 2014 21:09 UTC
Tags:

Oracle, Conferences, Databases, hadoop, continuent, big data, UKOUG, MySQL, Presentations and Conferences

So I’ve submitted my talks for the Tech14 UK Oracle User Group conference which is in Liverpool this year. I’m not going to give away the topics, but you can imagine they are going to be about data translation and movement and how to get your various databases talking together.

I can also say, after having seen other submissions for talks this year (as I’m helping to judge), that the conference is shaping up to be very interesting. There’s a good spread of different topics this year, but I know from having talked to the organisers that they are looking for more submissions in the areas of Operating Systems, Engineered Systems and Development (mobile and cloud).

If you’ve got a paper, presentation, or idea for one that you think would be useful, …

[Read more]

May

2014

Continuent Delivers Real-Time Data to Cloudera | Business Wire

Posted by Petri Virsunen of Continuent on Tue 06 May 2014 15:53 UTC
Tags:

Oracle, hadoop, data warehouse, cloudera, mariadb, MySQL, Continuent Tungsten, Continuent Tungsten Replicator, database replication

SAN JOSE, CA– May 6, 2014 – Continuent, Inc., a leading provider of open source database clustering and replication solutions, today announced that their recently announced Tungsten Replicator 3.0 solution has been certified by Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop™. Continuent Tungsten Replicator 3.0 enables organizations to quickly and easily

May

2014

Setup & operate Tungsten webinar series

Posted by Petri Virsunen of Continuent on Thu 01 May 2014 22:16 UTC
Tags:

Oracle, Clustering, hadoop, data warehouse, aws, vertica, GoldenGate, MySQL, Continuent Tungsten, Continuent Tungsten Replicator, database replication, database cluster

Don't miss your opportunity to learn about Continuent Tungsten via our free "Setup & Operate" webcast series. These free webcasts include live presentations and interactive Q&A.Webcast OverviewsSetup & Operate Tungsten ReplicatorMay 15th, 10:00 am PDTTungsten Replicator is an innovative and reliable tool that can solve your most complex replication problems. We will introduce Replicator

Apr

2014

See you at ICTexpo Helsinki 2014

Posted by Petri Virsunen of Continuent on Tue 22 Apr 2014 21:08 UTC
Tags:

cloud, Clustering, hadoop, MySQL, Continuent Tungsten, Continuent Tungsten Replicator, database replication, ICTexpo2014

ICTexpo Helsinki 2014 offers two effective days full of innovations, inspiration and information - the biggest professional IT show in the Nordics with large scale of solutions to help you to take your business to the next level. Continuent will be exhibiting in Red Hat Village [booth 5f31], which gathers the most significant enterprise level companies from the Open Source ecosystem in Finland

Apr

2014

Using Apache Hadoop and Impala together with MySQL for data analysis

Posted by Alexander Rubin of MySQL Performance Blog on Mon 21 Apr 2014 13:43 UTC
Tags:

scalability, hadoop, Hive, MySQL, Performance, Impala, Data Science

Apache Hadoop is commonly used for data analysis. It is fast for data loads and scalable. In a previous post I showed how to integrate MySQL with Hadoop. In this post I will show how to export a table from MySQL to Hadoop, load the data to Cloudera Impala (columnar format) and run a reporting on top of that. For the examples below I will use the “ontime flight performance” data from my previous post (Increasing MySQL performance with parallel query execution). I’ve used the Cloudera Manager v.4 to install Apache Hadoop and Impala. For this test …

[Read more]

Top Authors

Oracle MySQL Blogs

Vendor Blogs

MySQL Links