Does changing the default HDFS replication factor from 3 affect mapper performance?
        Posted  
        
            by 
                liamf
            
        on Server Fault
        
        See other posts from Server Fault
        
            or by liamf
        
        
        
        Published on 2011-06-29T15:57:24Z
        Indexed on 
            2011/06/29
            16:24 UTC
        
        
        Read the original article
        Hit count: 304
        
Have a HDFS/Hadoop cluster setup and am looking into tuning.
I wonder if changing the default HDFS replication factor (default:3) to something bigger will improve mapper performance, at the obvious expense of increasing disk storage used?
My reasoning being that if the data is already replicated to more nodes, mapper jobs can be run on more nodes in parallel without any data streaming/copying?
Anyone got any opinions?
© Server Fault or respective owner