Ubuntu Server mdadm drbd ocfs2 kvm hangs under heavy file reading

Posted by Stefano Annese on Server Fault See other posts from Server Fault or by Stefano Annese
Published on 2011-11-18T08:31:02Z Indexed on 2011/11/18 9:55 UTC
Read the original article Hit count: 353

Filed under:
|
|
|
|

I have deployed four ubuntu 10.04 server. They are coupled two by two in a cluster scenario. on both sides we have software raid1 disks, drbd8 and OCFS2 and on top of it some kvm machines run with qcow2 disks.

I followed this: Link

corosync is just used for DRBD and OCFS, the kvm machines are run "manually"

When it works is fine: good performances, good I/O, but at a given time one of the two cluster started hanging. Then we tried with just one server turned on and it hangs the same. It seems to happen when an heavy READ in one of the virtual machines occurs, that is during rsyn backup. When the fact occurs the virtual machines are not reachable any more and the real server responds with good delay to the ping but no screen and no ssh is available.

All we can do is force shutdown (hold the button) and restart and when it turns on again the raid on which relay drbd is resyncing. All the time it hangs we see such fact.

After a couple of week of pain on one side this morning also the other cluster hung, but it has different moteherboard, ram, kvm instances. What is similar is reading for rsync scenario and Western Digital RAID Edistion disks on both side.

Can anybody give me some input to solve such issue?

© Server Fault or respective owner

Related posts about kvm

Related posts about mdadm