I have a strange issue I've been struggling with the past month or two. We have a RHEL5 (5.8) system that is SAN-attached to CLARiiON storage. The admin claims that right after running updates on the server, it started logging QLA2 errors like these:
Everything I have found on the Internet about these sort of errors leads to faulty cabling. Well we have swapped every piece of fiber between the host and the CLARiiON array. And we swapped the connections on the back of the server and rezoned and the errors switched to the other HBA. So it's reasonably sure the problem exists outside the server? My only concern is that the errors started right after a "yum update". We do have a case open with EMC.
Has anyone run into this before? I'm hoping that if it is an issue with an update, someone has seen the problem also. If not, I'm sorry for taking up disk space and your time reading.
We have the same errors on an OVM 3.0.3 server (which is a similar kernel thant the uek in 5.8). Did you find the reason of your problem ? was it a problem outside of the server ? or was it related to HBA firmware/driver ?
To be honest, I never found out. I'm pretty sure the problem is outside but we swapped all the SAN infrastructure components and the problem still exists. I pretty much gave up when EMC said they had no other ideas...