Hadoop 0.2: How to read outputs from TextOutputFormat?

Posted by S.N on Stack Overflow See other posts from Stack Overflow or by S.N
Published on 2010-04-09T08:46:42Z Indexed on 2010/04/13 1:02 UTC
Read the original article Hit count: 409

Filed under:
|

My reducer class produces outputs with TextOutputFormat (the default OutputFormat given by Job). I like to consume this outputs after the MapReduce job complete to aggregate the outputs. In addition to this, I like to write out the aggregated information with TextInputFormat so that the output from this process can be consumed by the next iteration of MapReduce task. Can anyone give me an example on how to write & read with TextFormat? By the way, the reason why I am using TextFormat, rather Sequence, is the interoperability. The outputs should be consumed by any software.

© Stack Overflow or respective owner

Related posts about hadoop

Related posts about mapreduce