Planet MySQL

Displaying posts with tag: cloudera (reset)

Jun

2016

Posted by Zack Urlocker on Fri 03 Jun 2016 13:26 UTC
Tags:

Technology, Business, Red Hat, SugarCRM, JBoss, acquia, redmonk, cloudera, agile, disruption, datastax, MySQL, Kent Beck, monktoberfest, Software Paradox

Stephen O'Grady at RedMonk has launched a new Podcast called Hark. In his second episode, he and Agile programming guru Kent Beck have a thoughtful discussion around the ideas in O'Grady's book "The Software Paradox." Even though software is "eating the world" and become more widespread and strategic, its economic value appears to be declining rapidly. Certainly, we've seen a shift in the …

[Read more]

Apr

2016

Rosetta Stone: MySQL, Pig and Spark (Basics)

Posted by Todd Farmer on Wed 13 Apr 2016 20:27 UTC
Tags:

planetmysql, hadoop, cloudera, sqoop, Pig, MySQL, spark

In a world where new data processing languages appear every day, it can be helpful to have tutorials explaining language characteristics in detail from the ground up. This blog post is not such a tutorial. It also isn’t a tutorial on getting started with MySQL or Hadoop, nor is it a list of best practices for the various languages I’ll reference here – there are bound to be better ways to accomplish certain tasks, and where a choice was required, I’ve emphasized clarity and readability over performance. Finally, this isn’t meant to be a quickstart for SQL experts to access Hadoop – there are a number of SQL interfaces to Hadoop such as Impala or Hive that make Hadoop incredibly accessible to those with existing SQL skills.

Instead, this post is a pale equivalent of the …

[Read more]

Jan

2016

How to Deploy a Cluster

Posted by Valerie Parham-Thompson of The Pythian Group on Tue 05 Jan 2016 18:15 UTC
Tags:

cluster, hadoop, cloudera, big data, Hive, Technical Track, co-op

In this blog post I will talk about how to deploy a cluster, the methods I tried and my solution to resolving the prerequisites problem.

I’m fairly new to the big data field. Learning about Hadoop, I kept hearing the term “clusters”, deploying a cluster, and installing some services on namenode, some on datanode and so on. I also heard about Cloudera manager which helps me to deploy services on my cluster, so I set up a VM and followed several tutorials including the Cloudera documentation to install cloudera manager. However, every time I reached the “cluster installation” step my installation failed. I later found out that there are several prerequisites for a Cloudera Manager Installation, which was the reason for the failure to install.

Deploy a Cluster

Though I discuss 3 other methods in detail, ultimately I recommend method …

[Read more]

Apr

2015

Introducing VMware Continuent 4.0 – MySQL Clustering and Real-time Replication to Data Warehouses

Posted by Petri Virsunen of Continuent on Fri 17 Apr 2015 19:37 UTC
Tags:

Oracle, VMWare, continuent, mysql replication, cloudera, mapr, MySQL, HortonWorks, Apache Hadoop, mysql disaster recovery, mysql high availability, Pivotal, Amazon Redshift, HP Vertica, vCloud Air

It’s with great pleasure we announce the general availability of VMware Continuent 4.0 – a new suite of solutions for clustering and replication of MySQL to data warehouses.

VMware Continuent enables enterprises running business-critical database applications to achieve commercial-grade high availability (HA), globally redundant disaster recovery (DR) and performance scaling. The new suite

Jul

2014

Hadoop BoF Session at OSCON

Posted by MC Brown on Fri 18 Jul 2014 10:26 UTC
Tags:

oscon, hadoop, continuent, cloudera, big data, MySQL, Presentations and Conferences, oscon2014

I have a BoF session next week at OSCON next week:

Migrating Data from MySQL and Oracle into Hadoop

The session is at 7pm Tuesday night – look for rooms D135 and/or D137/138.

Correction: We are now in E144 on Tuesday with the Hadoop get together first at 7pm, and the Data Migration to follow at 8pm.

I’m actually going to be joined by Gwen Shapira from Cloudera, who has a BoF session on Hadoop next door at the same time, along with Eric Herman from Booking.com. We’ll use the opportunity to talk all things Hadoop, but particularly the ingestion of data from MySQL and other databases into the Hadoop datastore.

As always, it’d be great to meet anybody interested in Hadoop at the BoF, please come along and introduce yourselves, and …

[Read more]

May

2014

Continuent Delivers Real-Time Data to Cloudera | Business Wire

Posted by Petri Virsunen of Continuent on Tue 06 May 2014 15:53 UTC
Tags:

Oracle, hadoop, data warehouse, cloudera, mariadb, MySQL, Continuent Tungsten, Continuent Tungsten Replicator, database replication

SAN JOSE, CA– May 6, 2014 – Continuent, Inc., a leading provider of open source database clustering and replication solutions, today announced that their recently announced Tungsten Replicator 3.0 solution has been certified by Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop™. Continuent Tungsten Replicator 3.0 enables organizations to quickly and easily

Apr

2014

Tungsten Replicator 3.0 is Cloudera Enterprise 5 Certified

Posted by MC Brown on Wed 02 Apr 2014 20:16 UTC
Tags:

Replication, hadoop, continuent, cloudera, big data, MySQL, Coalface, tungsten-replicator, Cloudera Enterprise

One of the key platforms I’ve been testing on for the MySQL to Hadoop replication has been Cloudera, largely driven by customer requirements, but it’s also one of the easiest way to get started with Hadoop.

What I’m even more pleased about is the fact that we are proud to announce that Tungsten Replicator 3.0 is certified for use on the new Cloudera Enterprise 5 platform. That means that we’re sure that replicating your data from MySQL to Cloudera 5 and have it work without causing problems or difficulties on the Hadoop loading and materialisation.

Cloudera is a great product, and we’re very happy to be working so effectively with the new Cloudera Enterprise 5. Cloudera …

[Read more]

Nov

2012

Typical “Big” Data Architecture

Posted by Venu Anuganti on Fri 30 Nov 2012 22:15 UTC
Tags:

postgresql, sql, database, scalability, ETL, hadoop, data warehouse, MapReduce, hbase, reporting, cloudera, NoSQL, vertica, Hive, bigdata, MySQL, SAS, Big Data Architecture, Big Data Warehouse, Data Architecture, Impala, NoSQL and BigData, Data Analytics, Data Science, kognitio, druid

Here is the typical “Big” data architecture, that covers most components involved in the data pipeline. More or less, we have the same architecture in production in number of places[...]

Jan

2012

CAOS Theory Podcast 2012.01.20

Posted by The 451 Group on Fri 20 Jan 2012 20:24 UTC
Tags:

Oracle, Linux, podcast, opensource, solaris, caostheory, matt aslett, open-source, The 451 Group, the451group, OpenSolaris, M&A, Sun Microsystems, Systems Management, jay lyman, caos theory, cloudera, big data, Mergers and acquisitions, NoSQL, devops, newsql, MySQL, Apache Hadoop

Topics for this podcast:

*Hadoop v1.0 and year ahead
*Oracle-Cloudera deal for more Hadoop
*Oracle’s ‘Sun spot’ with Solaris
*Open Source M&A outlook for 2012
*Our new MySQL/NoSQL/NewSQL survey

iTunes or direct download (28:49, 4.9MB)

Jan

2012

OSSCube adds one more Cloudera Certified Developer in its Armor

Posted by Sonali Minocha on Fri 06 Jan 2012 09:16 UTC
Tags:

hadoop, cloudera

OSSCube has now one more Cloudera Certified Developer for Apache Hadoop. Rakesh Kumar has become the Cloudera Certified Developer through the CCDH, the industry's only certification for software developers on Hadoop. He passed the Cloudera Certified Developer for Apache Hadoop exam after going through a rigorous training program.

Rakesh is also a MySQL certified DBA and Cluster DBA and has trained several engineers for Zend Certification Examinations.

Tags: Cloudera Hadoop

Get Started Contributing

Top Authors

Oracle MySQL Blogs

Vendor Blogs

MySQL Links