Home |  MySQL Buzz |  FAQ |  Feeds |  Submit your blog feed |  Feedback |  Archive |  Aggregate feed RSS 2.0 English Deutsch Español Français Italiano 日本語 Русский Português 中文
Previous 30 Newer Entries Showing entries 61 to 90 of 124 Next 30 Older Entries

Displaying posts with tag: hadoop (reset)

CAOS Theory Podcast 2011.11.11
+0 Vote Up -0Vote Down

Topics for this podcast:

*Continuent extends MySQL replication to Oracle Database
*CFEngine updates server automation software
*Devops moving mainstream
*Neo Technology integrates with Spring
*451 CAOS report from Hadoop World

iTunes or direct download (26:56, 4.6MB)

OSSCube is now the World’s First Cloudera / Hadoop Training Partner
+0 Vote Up -0Vote Down

We are proud to share that OSSCube has now entered into a partnership with Cloudera, as the official training partner for Cloudera Developer Training for Apache Hadoop. The pride is, essentially, in the fact that we are the World’s First Cloudera / Hadoop Training Partner.

Cloudera provides enterprises a powerful new data platform built on the popular Apache Hadoop open source software package.  Hadoop is a powerful data management platform for consolidating your data, storing your information inexpensively and reliably and understanding large, heterogeneous data sets in order to better comprehend the data deluge.

OSSCube believes this partnership will deliver first-class



  [Read more...]
OSSCube is now the World’s First Cloudera / Hadoop Training Partner
+0 Vote Up -0Vote Down

We are proud to share that OSSCube has now entered into a partnership with Cloudera, as the official training partner for Cloudera Developer Training for Apache Hadoop. The pride is, essentially, in the fact that we are the World’s First Cloudera / Hadoop Training Partner.

Cloudera provides enterprises a powerful new data platform built on the popular Apache Hadoop open source software package.  Hadoop is a powerful data management platform for consolidating your data, storing your information inexpensively and reliably and understanding large, heterogeneous data sets in order to better comprehend the data deluge.

OSSCube believes this partnership will deliver first-class Cloudera /



  [Read more...]
451 CAOS Links 2011.10.18
+0 Vote Up -0Vote Down

DOCOMO adopts, invests in Couchbase. Apache Cassandra reaches 1.0. And more.

# DOCOMO Innovations adopted Couchbase as DOCOMO Capital invested in the NoSQL database vendor.

# The Apache Software Foundation announced Apache Cassandra v1.0.

# Nuxeo announced the availability of Nuxeo Cloud.

# SGI formed a distribution relationship with Cloudera and

  [Read more...]
451 CAOS Links 2011.10.07
+0 Vote Up -0Vote Down

OpenStack Foundation. New Pentaho CEO. And more.

# Rackspace announced its intention to form an independent OpenStack Foundation.

# HP has chosen Ubuntu as the lead host and guest operating system for its Public Cloud.

# Pentaho appointed Quentin Gallivan as its new CEO.

# Hortonworks continued the discussion about contributions to Apache Hadoop.

# Bob Bickel explained why CloudBees is not, itself, open source.

# Google

  [Read more...]
Webinar: NoSQL, NewSQL, Hadoop and the future of Big Data management
+0 Vote Up -0Vote Down

Join me for a webinar where I discuss how the recent changes and trends in big data management effect the enterprise.  This event is sponsored by Red Rock and RockSolid.

Overview:

It is an exciting and interesting time to be involved in data. More change of influence has occurred in the database management in the last 18 months than has occurred in the last 18 years. New technologies such as NoSQL & Hadoop and radical redesigns of existing technologies, like NewSQL , will change dramatically how we manage data moving forward. 

These technologies bring with them possibilities both in terms of the scale of data



  [Read more...]
451 CAOS Links 2011.09.23
+0 Vote Up -0Vote Down

Red Hat revenue up 28% in Q2. Funding for NoSQL vendors. And more.

# Red Hat reported net income of $40m in the second quarter on revenue up 28% to $281.3m.

# 10gen raised $20m in funding, while DataStax closed an $11m series B round, while also releasing its DataStax Enterprise and Community products. Additionally Neo Technology

  [Read more...]
What is the biggest challenge for Big Data?
+0 Vote Up -0Vote Down

Often I think about challenges that organizations face with “Big Data”.  While Big Data is a generic and over used term, what I am really referring to is an organizations ability to disseminate, understand and ultimately benefit from increasing volumes of data.  It is almost without question that in the future customers will be won/lost, competitive advantage will be gained/forfeited and businesses will succeed/fail based on their ability to leverage their data assets.

It may be surprising what I think are the near term challenges.  Largely I don’t think these are purely technical.  There are enough wheels in motion now to almost guarantee that data accessibility will continue to improve at pace in-line with the increase in data volume.  Sure, there will continue to be lots of interesting innovation with technology, but

  [Read more...]
NSA, Accumulo & Hadoop
+0 Vote Up -0Vote Down

Reading yesterday that the NSA has submitted a proposal to Apache to incubate their Accumulo platform.  This, according to the description, is a key/value store built over Hadoop which appears to provide similar function to HBase except it provides “cell level access labels” to allow fine grained access control.  This is something you would expect as a requirement for many applications built at government agencies like the NSA.  But this also is very important for organizations in health care and law enforcement etc where strict control is required to large volumes of privacy sensitive data.

An interesting part of this is how it highlights the acceptance of Hadoop.

  [Read more...]
Hadoops Everywhere
+2 Vote Up -0Vote Down

We don’t pay enough attention to Hadoop.

By “we” I mean DBAs, the rest of the world is paying plenty of attention to Hadoop. Recently, I started asking my customers and fellow DBAs about Hadoop adoption in their company. Turns out that many of them have Hadoop. Hadoop shows up in large companies and small ones, in established industries and in startups. Its everywhere.

The way Hadoop shows up in all companies, and the way DBAs don’t pay Hadoop much attention, reminds me a lot of how MySQL started showing up in the enterprise. It didn’t start by DBAs showing up one morning and telling their managers:
“There’s this new open source database. Its not as stable as Oracle and it doesn’t have all the features we need, but man – its going to save us tons of money, and its pretty simple to manage.”

Nope,


  [Read more...]
451 CAOS Links 2011.08.23
+0 Vote Up -0Vote Down

Engine Yard acquires Orchestra. Red Hat considers NoSQL move. And more.

# Engine Yard announced a definitive agreement to acquire Orchestra, bringing PHP expertise to the Engine Yard platform.

# Red Hat’s CEO indicated the company is interested in a NoSQL or Hadoop acquisition.

# Gluster announced Apache Hadoop compatibility in the next GlusterFS release.

# Microsoft signed an agreement with China Standard Software Co (CS2C) to support CS2C

  [Read more...]
Red Hat considering NoSQL/Hadoop acquisition
+0 Vote Up -0Vote Down

InternetNews.com yesterday published an article based on an interview with Red Hat CEO Jim Whitehurst asking the question “Is Red Hat Interested in the Database Market?”

In truth there was no real need to ask the question, as Whitehurst’s comments made it pretty clear that Red Hat is interested in the database market, and specifically the NoSQL database market.

“When I say I don’t want to be a database company, I’m saying that I don’t want to be a SQL database company,” Whitehurst said.

In case the implications of that statement were not entirely clear, he later added:

“But we would be very interested in a NoSQL type database or Hadoop type thing,” Whitehurst said.

  [Read more...]
Reply to The Future of the NoSQL, SQL, and RDBMS Markets
+0 Vote Up -0Vote Down

Conor O'Mahony over at IBM wrote a good post on a favorite topic of mine “The Future of the NoSQL, SQL, and RDBMS Markets”.  If this is of interest to you then I suggest you read his original post.  I replied in the comments but thought I would also repost my reply here.

-----------------------------------------------------------------------------------------------

Hi Connor, I wish it was as simple as SQL & RDBMS is good for this and NoSQL is good for that.  For me at least, the waters are much muddier than that.

The benefit of SQL & RDBMS is

  [Read more...]
451 CAOS Links 2011.08.09
+0 Vote Up -0Vote Down

Opscode appoints a new CEO. SugarCRM gains a new CFO. And more.

# Opscode named Mitch Hill as CEO, with Jesse Robbins becoming Chief Community Officer.

# SugarCRM claimed billings up 58% in Q2 and appointed a new CFO.

# Tasktop released Tasktop Dev 2.1 and announced Tasktop Sync 1.0.

# Pentaho delivered improved support for Hadoop and various NoSQL database

  [Read more...]
451 CAOS Links 2011.08.05
+0 Vote Up -0Vote Down

Google and Microsoft trade patent claims. Actuate announces Q2 results. And more.

# Google accused Microsoft, Oracle, Apple and other companies of organising a hostile patent campaign against Android. That prompted Microsoft executives to claim that Microsoft invited Google to be involved in the CPTN purchase of Novell’s patents. However, Google explained that joining CPTN might have decreased its ability to defend itself against potential patent claims.

# Actuate announced its Q2

  [Read more...]
IA Ventures - Jobs shout out
+0 Vote Up -0Vote Down

My friends over at IA Ventures are looking both for an Analyst and for an Associate to their team.  If Big Data, New York and start-ups is in your blood then I can’t think of a better VC to be involved in. 

From the IA blog:

"IA Ventures funds early-stage Big Data companies creating competitive advantage through data and we’re looking for two start-up junkies to join our team – one full-time associate / community manager and one full time analyst. Because there are only four of us (we’re a start-up ourselves, in fact), we’ll need you to help us investigate companies, learn about industries, develop investment theses, perform internal operations, organize

  [Read more...]
Realtime Data Pipelines
+0 Vote Up -0Vote Down

In life there are really two major types of data analytics.  Firstly, we don’t know what we want to know – so we need analytics to tell us what is interesting.  This is broadly called discovery.  Secondly, we already know what we want to know – we just need analytics to tell us this information, often repeatedly and as quickly as possible.  This is called anything from reporting or dashboarding through more general data transformation and so on.

Typically we are using the same techniques to achieve this.  We shove lots of data into a repository of some from (SQL, MPP SQL, NoSQL, HDFS etc) then run queries/ jobs/ processes across that data to retrieve the information we care about.  

Now this makes sense for data discovery.  If we don’t know what we want to know, having lots of data in a big pile that we can slice and dice

  [Read more...]
HPCC vs Hadoop at a glance
+0 Vote Up -0Vote Down

Update

Since this article was written, HPCC has undergone a number of significant changes and updates. This addresses some of the critique voiced in this blog post, such as the license (updated from AGPL to Apache 2.0) and integration with other tools. For more information, refer to the comments placed by Flavio Villanustre and Azana Baksh.

The original article can be read unaltered below:

Yesterday I noticed this tweet by Andrei Savu: . This prompted me to read the related GigaOM article and then check out the  [Read more...]
Measuring the scalability of SQL and NoSQL systems.
+0 Vote Up -0Vote Down
“Our experience from PNUTS also tells that these systems are hard to build: performance, but also scaleout, elasticity, failure handling, replication. You can’t afford to take any of these for granted when choosing a system. We wanted to find a way to call these out.” – Adam Silberstein and Raghu Ramakrishnan, Yahoo! Research. ___________________________________ A [...]
451 CAOS Links 2011.05.24
+0 Vote Up -0Vote Down

Shuttleworth opens a can of worms. Fedora 15. IBM commits to Hadoop. And more.

# Mark Shuttleworth shared his thoughts on companies, contributor agreements and free software, prompting responses from Simon Phipps, and Dave Neary.

# The Fedora Project launched Fedora 15.

# IBM launched a new version of its BigInsight software, based on Hadoop, committed $100m to “big data”.

  [Read more...]
451 CAOS Links 2011.05.10
+0 Vote Up -0Vote Down

EMC launches Greenplum HD. DataStax releases Brisk. And more.

# EMC launched its Greenplum HD Hadoop distribution, with the support of Jaspersoft, Pentaho, and SnapLogic, among others.

# DataStax released its

  [Read more...]
451 CAOS Links 2011.05.03
+0 Vote Up -0Vote Down

Novell sold to Attachmate. Barnes & Noble throws the book at Microsoft. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca, and daily at Paper.li/caostheory
“Tracking the open source news wires, so you don’t have to.”

# Novell closed its acquisition by Attachmate and its patent sale to CPTN.

# Attachmate’s CEO discussed the company’s plans for SUSE Linux.

# Barnes & Noble


  [Read more...]
451 CAOS Links 2011.04.12
+0 Vote Up -0Vote Down

Groklaw declares victory. Cloudera updates Hadoop distro. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca, and daily at Paper.li/caostheory
“Tracking the open source news wires, so you don’t have to.”

# Groklaw claimed victory, will stop publishing new articles on May 16.

# Cloudera released version 3 of its Hadoop distribution.

# VoltDB released version 1.3 of its open source distributed in-memory database.

# Black Duck grew sales by 51% in Q1.

# eXo and Convertigo partnered to add


  [Read more...]
451 CAOS Links 2011.03.25
+0 Vote Up -0Vote Down

Red Hat grows revenue 20%+. Google withholding Honeycomb source code. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca, and daily at Paper.li/caostheory
“Tracking the open source news wires, so you don’t have to.”

# Red Hat reported Q4 revenue up 25% to $245m, FY revenue up 22% to $909m

# Google is withholding the source code to Honeycomb for the foreseeable future.

# Rick Clark explained why he left Rackspace amid concerns that the company is exerting too much control over OpenStack.

# DataStax launched Brisk, a Hadoop/Hive


  [Read more...]
Who/What to acquire next
+1 Vote Up -0Vote Down

Well as predicted, with Aster Data recently being picked up by Teradata most of the key new generation MPP distributed analytics vendors have been acquired (Aster Data, Vertica, Netezza & Greenplum).  This had to happen and was expected to happen.  The MPP Analytics startup “revolution” is over and these technologies will now be integrated into the mainstream.

So what’s next?  As we now, if you are a massive multi-national software company it is a lot less risky to incrementally innovate and leave the development of “game changing” technologies to startups that can be acquired after

  [Read more...]
What’s hot in Big Data startups?
+0 Vote Up -0Vote Down

There are so, so many big data platforms in play at the moment it can be confusing for developers to know where to start.  For startups it used to be simple, MySQL, but dust clouds were created when all the NoSQL platforms started to crash the party 18 months or so ago.  But I do see the dust begin to settle and we are starting to see some market “leaders” appear.  A very unscientific approach is to list the technologies I hear about in the “big data startup” world on a daily basis.  These are, in no particular order:

  • MySQL - yes it is still very much hanging in there despite the Oracle acquisition.  MySQL has been helped by technologies such as AWS RDS and Xeround making it more digestible for big data startups who want
  [Read more...]
Q&A with Stephen Baker of "Final Jeopardy"
+0 Vote Up -0Vote Down

IBM's Watson natural language Question & Answer system made headlines recently with its primetime debut on Jeopardy.  Despite a few embarassing answers, Watson trounced top Jeopardy players Brad Rutter and Ken Jennings.  Watson is built from 90 IBM Power 750 IBM Linux servers with 16 terabytes of memory providing 80 Teraflops of processing power.  Watson is perhaps the most famous "Big Data" systems out there.  Watson's knowledge base

  [Read more...]
Free Hadoop class in Dallas
Employee +3 Vote Up -1Vote Down
Cloudera Instructor Tom Hanlon will be presenting a free class on Hadoop Tuesday March 15th on Dallas. Tom is a familiar face to MySQLers in the North Texas area having previous taught many MySQL classes. Pizza and Drinks will be provided.

This will be an excellent opportunity for MySQL DBAs to learn from booth a MySQL and Hadoop expert. Hadoop is a computational paradigm named Map/Reduce, where the application is divided into many small fragments of work which may be executed any node in the cluster.

Register Here as they may need to shift locations to find the anticipated crowd.
451 CAOS Links 2011.02.01
+0 Vote Up -0Vote Down

Hudson developers vote for Jenkins. SugarCRM turns cash flow positive. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca, and daily at Paper.li/caostheory
“Tracking the open source news wires, so you don’t have to.”

# The Hudson developer community voted overwhelmingly to rename the project Jenkins, and will continue without Oracle.

# SugarCRM turned cash flow positive in 2010 as billings increased 52% year on year.

# BonitaSoft announced the release of version 5.4 of Bonita Open Solution.

# WANdisco became a sponsor of the Apache


  [Read more...]
Hadoop Cluster Setup on Debian Lenny
+0 Vote Up -0Vote Down

Today I will describe the setup of a Hadoop / HDSF multi-node cluster on Debian Lenny with a redundant Namenode using DRBD and Heartbeat, four Datanodes and Tasktracker, a Backup- Checkpointnode and Rack awareness.

Hadoop Cluster Setup on Debian Lenny purposes

This article descibes how to setup a hadoop (version 0.21.0) cluster on debian lenny (version 5.x). I will not describe how to use MapReduce.

general

Hadoop is a framework for distributed computing written in Java. The project includs the following subprojects:

  • HDFS: A distributed file system
  • MapReduce: A framework for distributed large data processing
list of references   [Read more...]
Previous 30 Newer Entries Showing entries 61 to 90 of 124 Next 30 Older Entries

Planet MySQL © 1995, 2014, Oracle Corporation and/or its affiliates   Legal Policies | Your Privacy Rights | Terms of Use

Content reproduced on this site is the property of the respective copyright holders. It is not reviewed in advance by Oracle and does not necessarily represent the opinion of Oracle or any other party.