Planet MySQL Planet MySQL: Meta Deutsch Español Français Italiano 日本語 Русский Português 中文
Showing entries 1 to 4

Displaying posts with tag: data quality (reset)

Data Cleaner 2
+2 Vote Up -0Vote Down

Dear Kettle friends,

Some time ago while I visited the nice folks from Human Inference in Arnhem, I ran into Kasper Sørensen, the lead developer of DataCleaner.

DataCleaner is an open source data quality tool released (like Kettle) under the LGPL license.  It is essentially to blame for the lack of a profiling tool inside of Kettle.  That is because having DataCleaner available to our users was enough to push the …

  [Read more...]
Part 2: Comparing Numerics in Pentaho Data Integration
+0 Vote Up -0Vote Down

As a followup to my previous post about comparing numeric values, I've since discovered a little more about the problem. To repeat my original problem: certain numeric field values that should be equal are being detected as different in the Filter rows step. I think it's important to be able to perform accurate comparisons since it is a frequent task in data quality analysis.

Originally, I assumed this had something to do with jdbc. However, since I can re-produce the issue without any SQL, I'm sure this has nothing to do with the version of the …

  [Read more...]
451 CAOS Links 2009.08.04
+0 Vote Up -0Vote Down

OIN offers cash for patents. CentOS crisis averted. Microsoft denies GPL violation. And more.

Follow 451 CAOS Links live @caostheory on Twitter and
“Tracking the open source news wires, so you don’t have to.”

# Open Invention Network offered individual inventors cash for patents, and acquired patents from V_Graph.

# The H Open reported that the management problems at CentOS are now resolved. …

  [Read more...]
SQL_MODE and MySQL Data Quality
+0 Vote Up -0Vote Down

As my former boss will attest, I have a reputation for being a bit of a data quality zealot. The storage of data that is unfit for use leads to many problems, but I suppose that’s another subject for another day.

It’s tough enough to manage data quality problems introduced by source code errors, system failures, and requirements misunderstandings…But a default installation of MySQL introduces a new and exciting way to give us data quality evangelists fits: It allows unfit data to be inserted in the database. That’s the bad news. The good news is that by making a simple configuration change you can prevent this, and override the setting when …

  [Read more...]
Showing entries 1 to 4

Planet MySQL © 1995, 2015, Oracle Corporation and/or its affiliates   Legal Policies | Your Privacy Rights | Terms of Use

Content reproduced on this site is the property of the respective copyright holders. It is not reviewed in advance by Oracle and does not necessarily represent the opinion of Oracle or any other party.