I spoke to Daniel Abadi this morning about his HadoopDB announcement that came out a couple
of days back. I am sure this has been a busy time for
Daniel and his team over in Yale as HadoopDB has been getting a
lot of interest which I am sure will continue to build.
Some notes from our discussion:
- HadoopDB is primarily focused on high scalability and the required availability at scale. Daniel questions current MPP’s ability to truly scale past 100 nodes whereas Hadoop has real examples on 3000+ nodes.
- HadoopDB like many MPP analytical database platforms uses shared nothing relational database as processing units. HadoopDB uses Postgres. Unlike other MPP databases, HadoopDB uses Hadoop as the …