How to use Cassandra's Map Reduce with or w/o Pig?

Posted by UltimateBrent on Stack Overflow See other posts from Stack Overflow or by UltimateBrent
Published on 2010-04-29T00:17:18Z Indexed on 2010/04/29 0:27 UTC
Read the original article Hit count: 449

Filed under:
|
|

Can someone explain how MapReduce works with Cassandra .6? I've read through the word count example, but I don't quite follow what's happening on the Cassandra end vs. the "client" end.

https://svn.apache.org/repos/asf/cassandra/trunk/contrib/word_count/

For instance, let's say I'm using Python and Pycassa, how would I load in a new map reduce function, and then call it? Does my map reduce function have to be java that's installed on the cassandra server? If so, how do I call it from Pycassa?

There's also mention of Pig making this all easier, but I'm a complete Hadoop noob, so that didn't really help.

Your answer can use Thrift or whatever, I just mentioned Pycassa to denote the client side. I'm just trying to understand the difference between what runs in the Cassandra cluster vs. the actual server making the requests.

© Stack Overflow or respective owner

Related posts about cassandra

Related posts about mapreduce