Home |  MySQL Buzz |  FAQ |  Feeds |  Submit your blog feed |  Feedback |  Archive |  Aggregate feed RSS 2.0 English Deutsch Español Français Italiano 日本語 Русский Português 中文
Showing entries 1 to 30 of 70 Next 30 Older Entries

Displaying posts with tag: Pentaho (reset)

The Data Day, A few days: April 22-26 2013
+0 Vote Up -0Vote Down

Pivotal launches. SkySQL and Mony Program merge. And much, much more

Our report on the changes in the MySQL ecosystem is now available for 451 clients and non-clients alike at bit.ly/451mysql

— Matt Aslett (@maslett) April 25, 2013

For 451 Research clients: VMware expands Serengeti’s horizons with updated Hadoop virtualization project bit.ly/17muQFI

— Matt Aslett (@maslett) April 26, 2013

For 451 Research clients: SkySQL, Monty Program merge to support MariaDB following formation of MariaDB Foundation bit.ly/10dsdjf

  [Read more...]
451 CAOS Links 2011.11.01
+0 Vote Up -0Vote Down

Appcelerator raises $15m. Hortonworks launches Data Platform. And more.

# Appcelerator raised $15m in a third round led by Mayfield Fund, Translink Capital and Red Hat.

# Modo Labs closed a $4m investment from Storm Ventures and New Magellan Ventures.

# Hortonworks launched its Hortonworks Data Platform Apache Hadoop distribution, as well as a new partner program. Eric Baldeschwieler put the

  [Read more...]
451 CAOS Links 2011.10.07
+0 Vote Up -0Vote Down

OpenStack Foundation. New Pentaho CEO. And more.

# Rackspace announced its intention to form an independent OpenStack Foundation.

# HP has chosen Ubuntu as the lead host and guest operating system for its Public Cloud.

# Pentaho appointed Quentin Gallivan as its new CEO.

# Hortonworks continued the discussion about contributions to Apache Hadoop.

# Bob Bickel explained why CloudBees is not, itself, open source.

# Google

  [Read more...]
Proposals for Codebits.EU
+2 Vote Up -0Vote Down
Codebits is an annual 3-day conference about software and, well, code. It's organized by SAPO and this year's edition is to be held on November 10 thru 12 at the Pavilhão Atlântico, Sala Tejo in Lisbon, Portugal.

I've never attended SAPO Codebits before, but I heard good things about it from Datacharmer Giuseppe Maxia. The interesting thing about the way this conference is organized is that all proposals are available to the public, which can also vote for the proposals. This year's proposals are looking very interesting already, with high

  [Read more...]
451 CAOS Links 2011.08.09
+0 Vote Up -0Vote Down

Opscode appoints a new CEO. SugarCRM gains a new CFO. And more.

# Opscode named Mitch Hill as CEO, with Jesse Robbins becoming Chief Community Officer.

# SugarCRM claimed billings up 58% in Q2 and appointed a new CFO.

# Tasktop released Tasktop Dev 2.1 and announced Tasktop Sync 1.0.

# Pentaho delivered improved support for Hadoop and various NoSQL database

  [Read more...]
Real-time streaming data aggregation
+0 Vote Up -0Vote Down

Dear Kettle users,

Most of you usually use a data integration engine to process data in a batch-oriented way.  Pentaho Data Integration (Kettle) is typically deployed to run monthly, nightly, hourly workloads.  Sometimes folks run micro-batches of work every minute or so.  However, it’s lesser known that our beloved transformation engine can also be used to stream data indefinitely (never ending) from a source to a target.  This sort of data integration is sometimes referred to as being “streaming“, “real-time“, “near real-time“, “continuous” and so on.  Typical examples of situations where you have a never-ending supply of data that needs to be processed the instance it becomes available are JMS (Java Message Service), RDBMS log sniffing, on-line fraud

  [Read more...]
451 CAOS Links 2011.07.01
+0 Vote Up -0Vote Down

A herd of Hadoop announcements. Rockmelt raises $30m. And more.

A herd of Hadoop announcements
# Yahoo! and Benchmark Capital confirmed the formation of Hortonworks, an independent company focused on the development and support of Apache Hadoop.

# Cloudera announced the availability of Cloudera Enterprise 3.5 and the launch of Cloudera SCM Express, based on the new Service and Configuration Manager in Cloudera Enterprise 3.5.

# MapR


  [Read more...]
PDI Loading into LucidDB
+0 Vote Up -0Vote Down

By far, the most popular way for PDI users to load data into LucidDB is to use the PDI Streaming Loader. The streaming loader is a native PDI step that:

  • Enables high performance loading, directly over the network without the need for intermediate IO and shipping of data files.
  • Lets users choose more interesting (from a DW perspective) loading type into tables. In particular, in addition to simple INSERTs it allows for MERGE (aka UPSERT) and also UPDATE. All done, in the same, bulk loader.
  • Enables the metadata for the load to be managed, scheduled, and run in PDI.

However, we’ve had some known issues. In fact, until PDI 4.2 GA

  [Read more...]
HPCC vs Hadoop at a glance
+0 Vote Up -0Vote Down

Update

Since this article was written, HPCC has undergone a number of significant changes and updates. This addresses some of the critique voiced in this blog post, such as the license (updated from AGPL to Apache 2.0) and integration with other tools. For more information, refer to the comments placed by Flavio Villanustre and Azana Baksh.

The original article can be read unaltered below:

Yesterday I noticed this tweet by Andrei Savu: . This prompted me to read the related GigaOM article and then check out the  [Read more...]
SQL access to CouchDB views
+0 Vote Up -0Vote Down

Following up on my first post on an alternative, more SQL-eee metadata driven approach to doing BI on Big Data, I’d like to share an example on how we can enable easy reporting on top of BIg Data immediately for CouchDB users. We’re very keen on discussing with CouchDB/Hive/other Big Data users about their Ad Hoc and BI needs; please visit the forum thread about the connector.

We’ve been working with some new potential customers on how to leverage their investment in Big Data (specifically Big Couch provided by the fine folks at Cloudant. In

  [Read more...]
451 CAOS Links 2011.05.10
+0 Vote Up -0Vote Down

EMC launches Greenplum HD. DataStax releases Brisk. And more.

# EMC launched its Greenplum HD Hadoop distribution, with the support of Jaspersoft, Pentaho, and SnapLogic, among others.

# DataStax released its

  [Read more...]
451 CAOS Links 2011.04.12
+0 Vote Up -0Vote Down

Groklaw declares victory. Cloudera updates Hadoop distro. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca, and daily at Paper.li/caostheory
“Tracking the open source news wires, so you don’t have to.”

# Groklaw claimed victory, will stop publishing new articles on May 16.

# Cloudera released version 3 of its Hadoop distribution.

# VoltDB released version 1.3 of its open source distributed in-memory database.

# Black Duck grew sales by 51% in Q1.

# eXo and Convertigo partnered to add


  [Read more...]
451 CAOS Links 2011.03.22
+0 Vote Up -0Vote Down

Paranoid Android. Canonical and Gnome. A new OSI. And more.

Paranoid Android
If you are interested in the potential violation of the GPL by the Android kernel you have probably already immersed yourself in the numerous blog posts published on the topic. If not, start with Sean Hogle’s analysis or Bradley M Kuhn’s overview of the original allegations and work backwards from there, not forgetting a detour for the obligatory Microsoft connection. Linus Torvalds said claim “seems totally bogus”. In the

  [Read more...]
Reading from MongoDB
+1 Vote Up -2Vote Down

Hi Folks,

Now that we’re blogging again I thought I might as well continue to do so.

Today we’re reading data from MongoDB with Pentaho Data Integration.  We haven’t had a lot of requests for MongoDB support so there is no step to read from it yet.  However, it is surprisingly simple to do with the “User Defined Java Class” step.

For the following sample to work you need to be on a recent 4.2.0-M1 build.  Get it from here.

Then download mongo-2.4.jar and put it in the libext/ folder of your PDI/Kettle distribution.

Then you can read from a collection with the following “User Defined Java Class” code:

import java.math.*;
import java.util.*;
import java.util.Map.Entry;
import
  [Read more...]
Kettle vs Oracle REF CURSOR
+2 Vote Up -2Vote Down

Dear Kettle fans,

PDI-200 has been out there for a while now.  Jens created the feature request a little over 3 years ago.  I guess the main thing blocking this issue was not as much a technical problem but more of a licensing and dependency one (Oracle JDBC dependency and distribution license).

However, now that we have the User Defined Java Class step we can work around those pesky problems. That is because the Java code in there only gets compiled and executed at runtime so it’s perfectly fine to create any sort of dependency in there you like.

The following transformation reads a set of rows from a stored procedure as described on this web page.

In short, our UDJC step executes the following code:

begin ? := sp_get_stocks(?);

  [Read more...]
Pentaho Kettle Solutions Overview
+2 Vote Up -1Vote Down

Dear Kettle friends,

As mentioned in my previous blog post, copies of our new book Pentaho Kettle Solutions are finally shipping.  Roland, Jos and myself worked really hard on it and, as you can probably imagine, we were really happy when we finally got the physical version of our book in our hands.

So let’s take a look at what’s in this book, what the concept behind it was and give you an overview of the content…

The concept

Given the fact that Maria’s book, called

  [Read more...]
Open Source BI -- Pentaho and Jaspersoft Part I
Employee +1 Vote Up -0Vote Down
Hey DBAs! Are you seeking more efficient ways of shifting through your data to aid your business operations? Two popular Business Intelligence products have community Open Source software are Pentaho and JasperSoft. And both work with MySQL.

Both are easy to download and install. Both will use a JDBC connector to connect to MySQL. But how easy are the two to configure and run a simple report against a running instance of MySQL?


Setting up a JDBC connection with JasperSoft or Pentaho is pretty much like using any other JDBC connection.




  [Read more...]
Book Review : Pentaho 3.2 Data Integration
+1 Vote Up -2Vote Down

Dear Kettle fans,

A few weeks ago, when I was stuck in the US after the MySQL User Conference, a new book was published by Packt Publishing.

That all by itself is something that is not too remarkable.  However, this time it’s a book about my brainchild Kettle. That makes this book very special to me. The full title is Pentaho 3.2 Data Integration : Beginner’s Guide (Amazon, Packt).  The title all by itself explains the purpose of this book: give the reader a quick-start when it comes to Pentaho Data Integration (Kettle).

The author María Carina

  [Read more...]
Slides from my MySQL UC 2010 presentation
+1 Vote Up -0Vote Down

As requested by a few fans out there, here are the slides of my presentation:

Pentaho Data Integration 4.0 and MySQL.pdf

I had a great time at the conference, met a lot of nice folks, friends, customers, partners and colleagues. After the conference I was unable to get back home like so many of you because of the Paul Simon singing Eyjafjallajökul volcano in Iceland.

So I ended up flying over to Orlando for a week of brutal PDI 4.0 RC1 hacking with the rest of the l33t super Pentaho development team.  However, after 2+ weeks from home, even a severe storm over Philadelphia couldn’t prevent me from getting home eventually.

Until next time,
Matt

451 CAOS Links 2010.04.23
+0 Vote Up -0Vote Down

The White House contributes to OSS. Growth for Pentaho and MuleSoft. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca
“Tracking the open source news wires, so you don’t have to.”

# The White House released some of the custom code it has developed for whitehouse.gov as open source.

# Pentaho reported 229% bookings growth and 177% Enterprise Edition customer growth in Q2.

# MuleSoft grew bookings by 140% in Q1.

# The beta of Red Hat Enterprise Linux 6 is now available, without Xen.

# OpenCandy


  [Read more...]
MySQL User Conference 2010
+1 Vote Up -0Vote Down

Dear Kettle and MySQL fans,

Next week I’ll be strolling around the MySQL user conference in Santa Clara.  Even better, I’ll be presenting Tuesday afternoon (3:05pm).  The topic is Pentaho Data Integration 4.0 and MySQL.

The presentation will show you what the world’s most popular open source data integration tool can do for a MySQL user.  It will include practical examples and will showcase the latest improvements present in the brand new version 4.0.

Even more than the presentation itself, I’m looking forward to meeting you all over there.  The regular crowd, MySQL users, Pentaho partners, folks from

  [Read more...]
Investing in Disruption
+0 Vote Up -2Vote Down


 
 I'm an advisor, investor and board member to several startup software companies including Revolution Computing, Pentaho and most recently Erply a new Software as a Service (SaaS) company.  One of the common threads I look for is the opportunity to disrupt a large market.

One of the things that made MySQL successful was



  [Read more...]
451 CAOS Links 2010.03.23
+0 Vote Up -0Vote Down

Marten Mickos joins Eucalyptus. Novell rejects Elliot. Perspectives on OSBC. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca
“Tracking the open source news wires, so you don’t have to.”

# Mårten Mickos was named CEO of Eucalyptus Systems.

# Novell’s board rejected Elliot’s takeover proposal as inadequate, will review other alternatives.

# North Bridge Venture Partners published the results of its Future of Open Source survey.

# Rob Bearden was appointed executive chairman of the board of Pentaho.

# The Eclipse Foundation


  [Read more...]
Writing another book: Pentaho Kettle Solutions
+4 Vote Up -0Vote Down
Last year, at about this time of the year, I was well involved in the process of writing the book Pentaho Solutions: Business Intelligence and Data Warehousing with Pentaho and MySQL" for Wiley. To date, "Pentaho Solutions" is still the only all-round book on the open source Pentaho Business Intelligence suite.

It was an extremely interesting project to participate in, full of new experiences.

  [Read more...]
451 CAOS Links 2010.02.02
+1 Vote Up -1Vote Down

Oracle’s plans for Sun’s OSS. The UK’s updated OSS strategy. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca
“Tracking the open source news wires, so you don’t have to.”

Oracle’s plans for Sun’s OSS
# Oracle’s MySQL strategy slide.

# eWeek reported that database thought leaders are divided on Oracle MySQL.

# Savio Rodrigues and Computerworld on Oracle’s plans for MySQL, other open source assets.

# Zack Urlocker is leaving Oracle/Sun/MySQL.

# Red Hat’s Mark Little



  [Read more...]
Encrypt PDI passwords
+0 Vote Up -0Vote Down

PDI has a basic obfuscation method for making it difficult for casual people to lift passwords for DB connections. I have customers that maintain different versions of a “shared.xml” file that maintain different physical connections to databases (think development, QA/testing, and production).

In order to generate the different shared.xml, a user has to usually (per Matt Casters comment below there is a utility that allows user to do this outside of Spoon) open up PDI, created the connections, save them, and then sometimes copy and paste the sections needed to create their “dev” version of shared.xml or their “production” version of shared.xml. Many times this just to generate the password, as they can hand edit the other pieces (hostname, schema, etc).

I just committed a

  [Read more...]
Re-Introducing UDJC
+1 Vote Up -0Vote Down

Dear Kettle fans,

Daniel & I had a lot of fun in Orlando last week. Among other things we worked on the User Defined Java Class (UDJC) step.  If you have a bit of Java Experience, this step allows you to quickly write your own plugin in a step. This step is available in recent builds of Pentaho Data Integration (Kettle) version 4.

Now, how does this work?  Well, let’s take Roland Bouman’s example : the calculation of the the date of Easter.  In this blog post, Roland explains how to calculate Easter in MySQL and Kettle using JavaScript.  OK, so what if you want this calculation to be really fast in Kettle?  Well, then you can turn to pure Java to do the job…

import java.util.*;
  [Read more...]
A guide to The 451 Group’s open source software coverage
+0 Vote Up -0Vote Down

Regular visitors to the 451 CAOS Theory blog will be well aware of The 451 Group’s CAOS (Commercial Adoption of Open Source) research service and our CAOS long-form reports.

They are probably less aware of the open source coverage that The 451 Group provides on a day-to-day and week-to-week basis, however, and I thought it would be worthwhile to provide some examples of The 451 Group’s ongoing open source coverage by highlighting a few recent reports.

The company’s core services are 451 Market Insight Service, which delivers daily insight into emerging enterprise IT markets, and 451 TechDealmaker, a forward-looking weekly

  [Read more...]
A review of Pentaho Solutions by Roland Bouman and Jos van Dongen
+1 Vote Up -0Vote Down

Pentaho Solutions

Pentaho Solutions, Business Intelligence and Data Warehousing with Pentaho and MySQL. By Roland Bouman and Jos van Dongen, Wiley 2009. Page count: about 570 pages. (Here’s a link to the publisher’s site.)

The book is big in part because it’s about a GUI tool, so there are the requisite number of screenshots (but not too many). It is structured into four parts, each on a different topic.

The first part is 4

  [Read more...]
451 CAOS Links 2009.11.06
+0 Vote Up -0Vote Down

Funambol acquires Zapatec. Open source gains Closure. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca
“Tracking the open source news wires, so you don’t have to.”

For the latest on Oracle’s acquisition of MySQL via Sun, see Everything you always wanted to know about MySQL but were afraid to ask

# Funambol acquired Zapatec, an AJAX web 2.0 frameworks vendor.

# The top ten issues facing open source users, according to Mark Radcliffe.

# Google open sourced its Closure


  [Read more...]
Showing entries 1 to 30 of 70 Next 30 Older Entries

Planet MySQL © 1995, 2014, Oracle Corporation and/or its affiliates   Legal Policies | Your Privacy Rights | Terms of Use

Content reproduced on this site is the property of the respective copyright holders. It is not reviewed in advance by Oracle and does not necessarily represent the opinion of Oracle or any other party.