Planet MySQL

Displaying posts with tag: MapReduce (reset)

Jul

2016

The Uber Engineering Tech Stack, Part II: The Edge and Beyond

Posted by Uber Engineering on Thu 21 Jul 2016 16:09 UTC
Tags:

Open Source, javascript, database, Python, mobile, data, hadoop, MapReduce, git, big data, soa, cassandra, go, riak, Hive, node.js, MySQL, node, elasticsearch, Kafka, flask, General Engineering, d3.js, IPython, Jupyter, Mapbox, Marketplace, NPM, React, Ringpop, Uber Data, UberEATS, UberRUSH

Uber Engineering

Uber’s mission is transportation as reliable as running water, everywhere, for everyone. Last time, we talked about the foundation that powers Uber Engineering. Now, we’ll explore the parts of the stack that face riders and drivers, starting …

The post The Uber Engineering Tech Stack, Part II: The Edge and Beyond appeared first on Uber Engineering Blog.

May

2015

What’s the latest with Hadoop

Posted by Hovhannes Avoyan on Tue 26 May 2015 07:52 UTC
Tags:

Java, sql, Python, cloud computing, hadoop, MapReduce, big data, internet of things, Industry Info, Forrester Research

The Big Data explosion in recent years has created a vast number of new technologies in the area of data processing, storage, and management. One of the biggest names to appear on the scene is Hadoop. In case you need a quick review, Hadoop is a Big Data storage system that takes in large amounts of data from servers and breaks it into smaller, manageable chunks. The technology is complex but at a high level the Hadoop ecosystem essentially takes a “divide and conquer” approach to processing Big Data instead of processing data in tables, as in a relational database like Oracle or MySQL.

One projection expects …

[Read more]

Nov

2014

On Hadoop RDBMS. Interview with Monte Zweben.

Posted by Roberto V. Zicari on Sun 02 Nov 2014 18:15 UTC
Tags:

Java, Uncategorized, sql, RDBMS, analytics, hadoop, MapReduce, big data, NoSQL, nosql databases, relational databases, key value store, Monte Zweben, Splice Machine

“HBase and Hadoop are the only technologies proven to scale to dozens of petabytes on commodity servers, currently being used by companies such as Facebook, Twitter, Adobe and Salesforce.com.”–Monte Zweben.

Is it possible to turn Hadoop into a RDBMS? On this topic, I have interviewed Monte Zweben, Co-Founder and Chief Executive Officer of Splice Machine.

RVZ

Q1. What are the main challenges of applications and operational analytics that support real-time, interactive queries on data updated in real-time for Big Data?

Monte Zweben: Let’s break down “real-time, interactive queries on data updated in real-time for Big Data”. “Real-time, interactive queries” means that results need to be returned in milliseconds to a few seconds.
For “Data updated in real-time” to happen, …

[Read more]

Sep

2013

Data Analytics at NBCUniversal. Interview with Matthew Eric Bassett.

Posted by Roberto V. Zicari on Mon 23 Sep 2013 14:48 UTC
Tags:

Open Source, Uncategorized, Python, amazon, cloud, analytics, hadoop, MapReduce, big data, NoSQL, MySQL, cloud stores, nosql databases, relational databases, Amazon's EC, Elastic MapReduce, Matthew Eric Bassett, NBCUniversal

“The most valuable thing I’ve learned in this role is that judicious use of a little bit of knowledge can go a long way. I’ve seen colleagues and other companies get caught up in the “Big Data” craze by spend hundreds of thousands of pounds sterling on a Hadoop cluster that sees a few megabytes [...]

Dec

2012

On Big Data, Analytics and Hadoop. Interview with Daniel Abadi.

Posted by Roberto V. Zicari on Wed 05 Dec 2012 16:49 UTC
Tags:

Uncategorized, sql, Google, analytics, hadoop, MapReduce, big data, daniel abadi, NoSQL, hadapt, MySQL, Google BigTable, nosql databases, relational databases, Yale University

“Some people even think that “Hadoop” and “Big Data” are synonymous (though this is an over-characterization). Unfortunately, Hadoop was designed based on a paper by Google in 2004 which was focused on use cases involving unstructured data (e.g. extracting words and phrases from Webpages in order to create Google’s Web index). Since it was not [...]

Nov

2012

Typical “Big” Data Architecture

Posted by Venu Anuganti on Fri 30 Nov 2012 22:15 UTC
Tags:

postgresql, sql, database, scalability, ETL, hadoop, data warehouse, MapReduce, hbase, reporting, cloudera, NoSQL, vertica, Hive, bigdata, MySQL, SAS, Big Data Architecture, Big Data Warehouse, Data Architecture, Impala, NoSQL and BigData, Data Analytics, Data Science, kognitio, druid

Here is the typical “Big” data architecture, that covers most components involved in the data pipeline. More or less, we have the same architecture in production in number of places[...]

Jul

2012

MySQL and Hadoop

Posted by Oracle MySQL Group on Thu 26 Jul 2012 11:50 UTC
Tags:

hadoop, MapReduce, MySQL, hdfs

Introduction

"Improving MySQL performance using Hadoop" was the talk which I and Manish Kumar gave at Java One & Oracle Develop 2012, India. Based on the response and interest of the audience, we decided to summarize the talk in a blog post. The slides of this talk can be found here. They also include a screen-cast of a live Hadoop system pulling data from MySQL and working on the popular 'word count' problem.

MySQL and Hadoop have been popularly considered as 'Friends with benefits' and our talk was aimed at showing how!

The benefits of MySQL to developers are the speed, reliability, data integrity and …

[Read more]

Feb

2012

A super-set of MySQL for Big Data. Interview with John Busch, Schooner.

Posted by Roberto V. Zicari on Mon 20 Feb 2012 09:28 UTC
Tags:

Oracle, Uncategorized, sql, innodb, memcached, hadoop, MapReduce, big data, mariadb, schooner, NoSQL, voltdb, MySQL, nosql databases, Apache Hadoop, SchoonerSQL, John Busch, Schooner Information Technology

“Legacy MySQL does not scale well on a single node, which forces granular sharding and explicit application code changes to make them sharding-aware and results in low utilization of severs”– Dr. John Busch, Schooner Information Technology A super-set of MySQL suitable for Big Data? On this subject, I have interviewed Dr. John Busch, Founder, Chairman, [...]

Aug

2011

451 CAOS Links 2011.08.23

Posted by The 451 Group on Tue 23 Aug 2011 16:32 UTC
Tags:

links, Linux, microsoft, opensource, Mozilla, Red Hat, 451 group, 451caostheory, 451group, caostheory, matt aslett, mattaslett, matthew aslett, matthewaslett, open-source, The 451 Group, the451group, hadoop, jay lyman, MapReduce, twitter, engine yard, bootstrap, NoSQL, mongodb, rapid7, Gluster, cloudbees, HortonWorks, cabinet office, CS2C, GlusterFS, MongoHQ, Orchestra

Engine Yard acquires Orchestra. Red Hat considers NoSQL move. And more.

# Engine Yard announced a definitive agreement to acquire Orchestra, bringing PHP expertise to the Engine Yard platform.

# Red Hat’s CEO indicated the company is interested in a NoSQL or Hadoop acquisition.

# Gluster announced Apache Hadoop compatibility in the next GlusterFS release.

# Microsoft signed an agreement with China Standard Software Co (CS2C) to …

[Read more]

Jul

2011

451 CAOS Links 2011.07.01

Posted by The 451 Group on Fri 01 Jul 2011 13:50 UTC
Tags:

links, enterprisedb, Linux, Apache, microsoft, Yahoo, opensource, benchmark, Pentaho, Red Hat, 451 group, 451caostheory, 451group, caostheory, matt aslett, mattaslett, matthew aslett, matthewaslett, open-source, The 451 Group, the451group, Platform, JasperSoft, VMWare, MapReduce, talend, android, cloudera, likewise, karmasphere, skysql, basho, rockmelt, mapr, beyondtrust, CASH music, donald j rippert, HortonWorks, Platfora, SCM Express, shadow-soft, sin, stackIQ, Xamarin

A herd of Hadoop announcements. Rockmelt raises $30m. And more.

A herd of Hadoop announcements
# Yahoo! and Benchmark Capital confirmed the formation of Hortonworks, an independent company focused on the development and support of Apache Hadoop.

# Cloudera announced the availability of Cloudera Enterprise 3.5 and the launch of Cloudera SCM Express, based on the new Service and Configuration Manager in Cloudera Enterprise 3.5.

# MapR …

[Read more]

Top Authors

Oracle MySQL Blogs

Vendor Blogs

MySQL Links