We have an issue with Solaris 11.0 (SRU10.5). It is installed on a super micro system, with DDR write cache, SSD read cache and SAS disks. Every now and then these boxes end up in a "ZFS pool suspended" state, the OS still responds. As a consequence we need to reboot the box. Not yet able to capture a core dump or to reproduce the conditions/error. Upgrade to 11.1 not possible, due to some other bugs. Call logged at Oracle support.
Does someone recognise this type of problem? And maybe some workarounds?
Thanks for your reply. We did all the suggestions you made in your reply. However, still stuck. We are trying to find a way to mimic the issues and cause the problem so we do a crash dump. So far no luck. Suggestions still welcome.