11GR2 Rac 节点重启问题
4节点aix rac os版本7100-03 emc的存储
集群和db版本均为11.2.0.4 打了最新的PSU
当db执行批量操作是,存储读写IO总量超过每秒 1GB,则集群日志会报以下错误,然后开始驱逐节点:
2016-05-06 13:54:43.546:
[cssd(23069096)]CRS-1612:Network communication with node nkhxdb02 (2) missing for 50% of timeout interval. Removal of this node from cluster in 14.737 seconds
2016-05-06 13:54:51.577:
[cssd(23069096)]CRS-1611:Network communication with node nkhxdb02 (2) missing for 75% of timeout interval. Removal of this node from cluster in 6.706 seconds
2016-05-06 13:54:55.583:
[cssd(23069096)]CRS-1610:Network communication with node nkhxdb02 (2) missing for 90% of timeout interval. Removal of this node from cluster in 2.700 seconds
2016-05-06 13:54:58.285:
[cssd(23069096)]CRS-1607:Node nkhxdb02 is being evicted in cluster incarnation 358005899; details at (:CSSNM00007:) in /u01/app/11.2.0/grid/log/nkhxdb01/cssd/ocssd.log.