1 Reply Latest reply: Dec 14, 2009 9:27 PM by 807557 RSS

    [Help!] SP Request to Reset Host due to Watchdog...

    807557
      A T6320 modual installed Solaris 10 u4, when ever I boot up the OS into a multi-user level, the system will reset automatically, but in single-user level, everything is OK. Please refer to the following information, it is about a process from poweron to boot then the OS reseted then boot again then reset again then the system gived up:

      ----------------------------------------------------------------------------------------------------
      sc> poweron
      Chassis | major: Host has been powered on

      sc> console

      Enter #. to return to ALOM. D one

      2008-08-06 02:55:58.709 0:0:0>INFO:

      2008-08-06 02:55:58.716 0:0:0> POST Passed all devices.

      2008-08-06 02:55:58.728 0:0:0>POST: Return to VBSC.

      2008-08-06 02:55:58.739 0:0:0>Master set ACK for vbsc runpost command and spin.. .

      Chassis | major: Host is running





      Sun Blade T6320 Server Module, No Keyboard

      Copyright 2007 Sun Microsystems, Inc. All rights reserved.

      OpenBoot 4.27.10, 16256 MB memory available, Serial #80085274.

      Ethernet address 0:14:4f:c6:1:1a, Host ID: 84c6011a.







      Boot device: /pci@0/pci@0/pci@2/LSILogic,sas@0/disk@0,0:a File and args: -k

      Loading kmdb...

      SunOS Release 5.10 Version Generic_120011-14 64-bit

      Copyright 1983-2007 Sun Microsystems, Inc. All rights reserved.

      Use is subject to license terms.

      WARNING: consconfig: cannot find driver for screen device

      Hostname: TL116

      NIS domain name is

      Aug 6 10:58:48 svc.startd[7]: svc:/network/nis/client:default: Method "/lib/svc/method/yp" failed with exit status 96.

      Aug 6 10:58:48 svc.startd[7]: network/nis/client:default misconfigured: transitioned to maintenance (see 'svcs -xv' for details)



      TL116 console login: Aug 6 10:58:59 TL116 sendmail[394]: My unqualified host name (TL116) unknown; sleeping for retry

      Aug 6 10:58:59 TL116 sendmail[393]: My unqualified host name (TL116) unknown; sleeping for retry

      Chassis | critical: SP Request to Reset Host due to Watchdog

      Chassis | critical: Host has been reset

      Chassis | critical: Host has been powered off

      Chassis | major: Host has been powered on

      0:0:0>

      0:0:0>Sun Blade T6320 Server Module POST 4.27.10 2007/12/07 11:14

      /export/delivery/delivery/4.27/4.27.10/post4.27.x/Niagara/glendale/integrated (root)

      0:0:0>Copyright 2007 Sun Microsystems, Inc. All rights reserved

      0:0:0>VBSC cmp 0 arg is: 0000ff00.ff00ffff

      0:0:0>POST enabling threads: 0000ff00.ff00ffff

      0:0:0>VBSC mode is: 00000000.00000001

      0:0:0>VBSC level is: 00000000.00000001

      0:0:0>VBSC selecting Normal mode, MAX Testing.

      0:0:0>VBSC setting verbosity level 2

      0:0:0>Basic Memory Tests....Done

      0:0:0>Test Memory....Done

      0:0:0>Setup POST Mailbox ....Done

      0:0:0>Master CPU Tests Basic....Done

      0:0:0>Init MMU.....

      0:0:0>L2 Tests....Done

      0:0:0>Extended CPU Tests....Done

      0:0:0>Scrub Memory....Done

      0:0:0>SPU CWQ Tests...Done

      0:0:0>MAU Tests...Done

      0:0:0>NCU Tests....Done

      0:0:0>Network Interface Unit Tests....Done

      0:0:0>Functional CPU Tests....Done

      0:0:0>Extended Memory Tests....Done

      2008-08-06 03:03:14.773 0:0:0>INFO:

      2008-08-06 03:03:14.779 0:0:0> POST Passed all devices.

      2008-08-06 03:03:14.788 0:0:0>POST: Return to VBSC.

      2008-08-06 03:03:14.797 0:0:0>Master set ACK for vbsc runpost command and spin...

      Chassis | major: Host is running





      Sun Blade T6320 Server Module, No Keyboard

      Copyright 2007 Sun Microsystems, Inc. All rights reserved.

      OpenBoot 4.27.10, 16256 MB memory available, Serial #80085274.

      Ethernet address 0:14:4f:c6:1:1a, Host ID: 84c6011a.







      Boot device: /pci@0/pci@0/pci@2/LSILogic,sas@0/disk@0,0:a File and args: -k

      Loading kmdb...

      SunOS Release 5.10 Version Generic_120011-14 64-bit

      Copyright 1983-2007 Sun Microsystems, Inc. All rights reserved.

      Use is subject to license terms.

      WARNING: consconfig: cannot find driver for screen device

      Hostname: TL116

      NIS domain name is

      Aug 6 11:05:54 svc.startd[7]: svc:/network/nis/client:default: Method "/lib/svc/method/yp" failed with exit status 96.

      Aug 6 11:05:54 svc.startd[7]: network/nis/client:default misconfigured: transitioned to maintenance (see 'svcs -xv' for details)



      TL116 console login: Aug 6 11:06:05 TL116 sendmail[393]: My unqualified host name (TL116) unknown; sleeping for retry

      Aug 6 11:06:05 TL116 sendmail[394]: My unqualified host name (TL116) unknown; sleeping for retry

      panic: failed to stop cpu0

      Cross trap sync timeout at cpu_sync.xword[0]: 0x100000000000000



      panic[cpu40]/thread=300019c3340: xt_sync: timeout



      000002a101e50f60 unix:xt_sync+17c (2636ffc09e, 2a101e51010, 0, 0, 2634d37fef, 2634d37fff)

      %l0-3: 0000000000000001 8000000000000000 0000000000000000 000002a101e51010

      %l4-7: 0000000001855800 000000000103cc00 0100000000000000 00000000022c411f

      000002a101e51050 unix:hat_unload_callback+808 (1, 2a101e51380, 0, 0, 0, 30000029b40)

      %l0-3: 0000000000000000 0000000000000001 000002a101e51380 000002a101e51380

      %l4-7: 000000039f696768 000003000295a768 0000000000000001 000000000000fe47

      000002a101e513c0 genunix:anon_private+20c (2a101e515b0, 6001d9799e0, fe478000, 6001e175210, 7001a5e9b80, 0)

      %l0-3: 0000000000000002 0000000000000000 000000000000000b 00000600063f1990

      %l4-7: 00000300018b0000 0000000000000001 0000000000000000 000007001c2dae80

      000002a101e514c0 genunix:segvn_faultpage+924 (600063f1990, 6001d9799e0, fe478000, 0, 600063f1990, 7c)

      %l0-3: 0000000000000001 000006001d987d80 000000000000000b 0000000000000000

      %l4-7: 0000000000000000 0000000000000002 0000000000000001 000006001d974e40

      ----------------------------------------------------------------------------------------------------

      I think the key is the watchdog, because according to the information, the auto reset was requested by the Watchdog:

      ----------------------------------------------------------------------------------------------------
      Chassis | critical: SP Request to Reset Host due to Watchdog
      ----------------------------------------------------------------------------------------------------

      Actually I don't know clearly what is the "Watchdog", so I don't know that:

      # In what situation the Watchdog will reset the system? #

      I can not find out where the problem possiblily be...

      Would any one of you please help me?