Linux bonded Interfaces hanging periodically

Posted by David on Server Fault See other posts from Server Fault or by David
Published on 2012-03-20T18:13:29Z Indexed on 2012/03/20 23:32 UTC
Read the original article Hit count: 219

Filed under:
|
|
|
|

I've several hosts that are showing problems with connectivity. When working from the command line, for example, typing is frozen for a second or so, then recovers - then it does it again.

The most egregious example host would freeze (input) for 15-30 seconds, then recover and go out 5 seconds later. Switching cables didn't do anything - but removing one of the physical cables caused everything to clear up instantly (which why I think this is a network problem).

Looking at the network I couldn't see any packets floating that would explain this.

These ethernet interfaces (Gigabit Dell) were working normally previously, but since we moved the systems - and put them on a new set of switches - this has been a problem on multiple theoretically identically-configured hosts.

The original switches were an HP Procurve 1810-24G and an HP Procurve 1800-24G connected with LLDP; the new switches are both Cisco SG 200-26, which I understand are rebranded Linksys switches.

Is this caused by a problem with the switches? Is it the switch configurations? Are the Cisco switches incapable of handling this?

I don't see where the configuration is located; I searched the usual /etc/sysconfig/network/devices but there's nothing in there about options (like mii polling) and nothing about the method of balancing the two. Searching scripts, I can't find anything in /etc/init.d/network either.

The hosts are almost all Red Hat Enterprise Linux 5.x systems (5.6, 5.7) but some are Ubuntu Server 10.04.3 Lucid Lynx. I need help with both if it comes to that.

UPDATE: We're also seeing some problems with servers on the original switches.

The HP switches and the Cisco switches are also interconnected (temporarily); there is a cable run from one switch to the next. Pings on any of these hosts show about one ICMP packet out of every 5-6 getting dropped (timed out). Could there be an interaction between the two switches?

Oh, and the hosts are using bonding with Balance-RR as the method.

© Server Fault or respective owner

Related posts about ubuntu

Related posts about networking