Showing entries 1 to 2
Displaying posts with tag: Spreadsheet (reset)
Why Oracle’s donation of OpenOffice disappoints

While Oracle deserves some praise for its donation of OpenOffice.org code to the Apache Foundation, it is disappointing again to see a legitimate open source market contender that has been marginalized by miscommunication and mismanagement of the project by a large vendor.

OpenOffice.org, warts and all, was probably the most significant competition for Microsoft Office for years and in many ways demonstrated the advantages of open source, helping usher in wider use of it, as well as greater usability. OO.o was in fact my reason for originally investigating and moving to open source software more than a decade ago. Regardless of past mismanagement of community and technology, that competitive factor has been diminished greatly since Oracle took ownership of OO.o. Now, after prompting a fork — as has …

[Read more]
Parse nasty XLS with dynamic ETL

Dear Kettle friends,

Last year, right after the summer in version 4.1 of Pentaho Data Integration, we introduced the notion of dynamically inserted ETL metadata (Youtube video here).  Since then we received a lot of positive feedback on this functionality which encouraged me to extend it to a few more steps. Already with support for “CSV Input” and “Select Values” we could do a lot of dynamic things.  However, we can clearly do a lot better by extending our initiative to a few more steps: “Microsoft Excel Input” (which can also read ODS by the way), “Row Normalizer” and “Row De-normalizer”.

Below I’ll describe an actual (obfuscated) example that you will probably recognize as it is equally hideous as simple in it’s horrible complexity.

Take a look at this file:

Let’s assume that this spreadsheet …

[Read more]
Showing entries 1 to 2