Which Hadoop API version should I use?

Posted by Niels Basjes on Stack Overflow See other posts from Stack Overflow or by Niels Basjes
Published on 2010-05-18T07:16:40Z Indexed on 2010/05/18 17:20 UTC
Read the original article Hit count: 221

Filed under:
|
|

In the latest Hadoop Studio the 0.18 API of Hadoop is called "Stable" and the 0.20 API of Hadoop is called "Unstable".

The distribution that comes from Yahoo is a 0.20 (with yahoo patches), which is apparently "the way to go". From cloudera they state the 0.20 (with cloudera patches) is also stable.

Now given the fact that we'll start coding a new Hadoop project in the next few weeks; which API should we use and which Hadoop distribution (Apache, Cloudera, Yahoo, ...) should we use?

Thanks for your insights.

© Stack Overflow or respective owner

Related posts about hadoop

Related posts about api