6 Replies Latest reply: Jul 22, 2011 2:29 AM by Soory RSS

    Blade 1500 Solaris 10 Issue

    810681
      Hi there

      my Blade 1500 /var/adm/messages keeps showing:


      *Jan 21 20:30:17 blade1500 picld[162]: [ID 845468 daemon.crit] SUNW_piclenvd: 'cpu' sensor temperature 96 outside safe operating limits (0...95).*


      However it passes all self-tests etc and the fans seem OK.

      If I idle for 30 minutes the temp drops to around 65, but just a single tap on an ssh connected session and the temp jumps up to 94/95!!

      Is this issue something to be concerned with? Faulty sensor or genuine issue?

      I have OBP 4.9.5 (I think)

      Thanks
        • 1. Re: Blade 1500 Solaris 10 Issue
          ora_tech
          Hi,

          Can u post the O/P of prtdiag -v command.

          thanks,
          X A H E E R
          • 2. Re: Blade 1500 Solaris 10 Issue
            810681
            Hi there.

            Here it is :)


            -bash-3.00# prtdiag -v
            System Configuration: Sun Microsystems sun4u Sun Blade 1500
            System clock frequency: 177 MHZ
            Memory size: 1GB

            ==================================== CPUs ====================================
            E$ CPU CPU
            CPU Freq Size Implementation Mask Status Location
            --- -------- ---------- --------------------- ----- ------ --------
            0 1062 MHz 1MB SUNW,UltraSPARC-IIIi 2.4 on-line MB/0

            ================================= IO Devices =================================
            Bus Freq Slot + Name +
            Type MHz Status Path Model
            ------ ---- ---------- ---------------------------- --------------------
            pci 33 MB isa/su (serial)
            okay /pci@1e,600000/isa@7/serial@0,3f8

            pci 33 MB isa/su (serial)
            okay /pci@1e,600000/isa@7/serial@0,2e8

            pci 33 MB isa/isadma (dma)
            okay /pci@1e,600000/isa@7/dma@0,0

            pci 33 MB pci10b9,5451 (sound)
            okay /pci@1e,600000/sound@8

            pci 33 MB pci10b9,5229 (ide)
            okay /pci@1e,600000/ide@d

            pci 33 MB/PCI0 pciclass,0c0010 (firewire)
            okay /pci@1e,600000/pci@2/firewire@b

            pci 66 MB pci108e,1647 (network)
            okay /pci@1f,700000/network@2

            pci 66 MB/PCI4 SUNW,XVR-100 (display) SUNW,375-3181
            okay /pci@1f,700000/SUNW,XVR-100@3


            ============================ Memory Configuration ============================
            Segment Table:
            -----------------------------------------------------------------------
            Base Address Size Interleave Factor Contains
            -----------------------------------------------------------------------
            0x0 1GB 4 BankIDs 0,1,2,3

            Bank Table:
            -----------------------------------------------------------
            Physical Location
            ID ControllerID GroupID Size Interleave Way
            -----------------------------------------------------------
            0 0 0 256MB 0,1,2,3
            1 0 1 256MB
            2 0 1 256MB
            3 0 0 256MB

            Memory Module Groups:
            --------------------------------------------------
            ControllerID GroupID Labels Status
            --------------------------------------------------
            0 0 MB/DIMM0
            0 0 MB/DIMM1
            0 1 MB/DIMM2
            0 1 MB/DIMM3

            ============================ Environmental Status ============================
            Fan Status:
            -------------------------------------------
            Location Sensor Status
            -------------------------------------------
            F2 CPU okay
            F1 Intake okay
            F0 Outtake okay

            Temperature sensors:
            -----------------------------------------
            Location Sensor Status
            -----------------------------------------
            MB/0 Die okay
            MB Ambient okay
            MB Ambient okay

            ================================ HW Revisions ================================
            ASIC Revisions:
            -------------------------------------------------------------------
            Path Device Status Revision
            -------------------------------------------------------------------
            /pci@1e,600000 pci108e,a801 okay 4
            /pci@1f,700000 pci108e,a801 okay 4

            System PROM revisions:
            ----------------------
            OBP 4.9.5 2003/09/15 14:00 Sun Blade 1500
            OBDIAG 4.9.5 2003/09/15 14:00
            -bash-3.00#



            Sometimes the "MB/0 Die okay" will show Warning instead of OK!

            Thanks for your help...
            • 3. Re: Blade 1500 Solaris 10 Issue
              810681
              I should also say the cpu is idling at 99/100% most of the time!

              If I cold boot the system in the morning I see the temp at 65... as soon as I do anything (even an ls) it will" jump" up to 90+ and stay there until I let the system idle for an hour or so and then it will go back down to 65. Most of the time however it is between 90-96 (never exceeds 96 for some reason).

              These messages occur many times a day and are clogging the logs!

              Thanks.

              from mpstat 10

              CPU minf mjf xcal intr ithr csw icsw migr smtx srw syscl usr sys wt idl
              0 0 0 0 270 41 212 0 0 0 0 44 0 0 0 100
              0 0 0 0 279 43 232 0 0 0 0 84 0 0 0 99
              0 0 0 0 271 42 213 0 0 0 0 48 0 0 0 99
              0 0 0 0 273 43 213 0 0 0 0 48 0 0 0 99
              0 0 0 0 280 51 208 0 0 0 0 44 0 2 0 98
              0 0 0 0 284 43 202 0 0 0 0 50 0 0 0 100
              0 0 0 0 270 41 200 0 0 0 0 44 0 0 0 99
              0 0 0 0 284 56 208 0 0 0 0 44 0 2 0 98
              0 0 0 0 270 41 197 0 0 0 0 47 0 0 0 99
              0 0 0 0 271 43 195 0 0 0 0 45 0 0 0 99
              0 0 0 0 270 41 196 0 0 0 0 44 0 0 0 99
              0 0 0 0 271 42 196 0 0 0 0 47 0 0 0 100
              0 0 0 0 269 41 195 0 0 0 0 44 0 0 0 99
              0 0 0 0 272 43 213 0 0 0 0 76 0 0 0 99
              0 0 0 0 270 41 197 0 0 0 0 47 0 0 0 99
              0 0 0 0 270 42 195 0 0 0 0 45 0 0 0 99
              0 0 0 0 268 40 194 0 0 0 0 44 0 0 0 100
              0 0 0 0 271 42 196 0 0 0 0 47 0 0 0 99
              0 0 0 0 281 53 199 0 0 0 0 46 0 0 0 99
              0 0 0 0 285 56 201 0 0 0 0 46 0 0 0 99
              CPU minf mjf xcal intr ithr csw icsw migr smtx srw syscl usr sys wt idl
              0 0 0 0 286 56 200 0 0 0 0 48 0 0 0 100
              0 0 0 0 284 55 199 0 0 0 0 47 0 0 0 99
              0 0 0 0 284 56 200 0 0 0 0 46 0 0 0 100
              0 0 0 0 285 56 200 0 0 0 0 47 0 0 0 99
              0 0 0 0 284 55 200 0 0 0 0 47 0 0 0 100

              Edited by: 807678 on 25-Jan-2011 05:35
              • 4. Re: Blade 1500 Solaris 10 Issue
                SteveS
                Hello,

                OBP 4.9.5 is, as you can see from the bottom of the prtdiag output, 8yrs out of date. The latest for that system is OBP 4.30.4.a + POST 4.30.4.a + OBDIAG 4.30.4.a as delivered with patch

                140686-02 "Hardware/PROM: Sun Blade 1500 (non Silver) Flash PROM Update"
                140687-02 "Hardware/PROM: Sun Blade 1500 Silver Flash PROM Update"

                The difference between the Silver and Non Silver versions are denoted (visually) by the colour of the plastic bubble where you see the Sun Logo. If it's a Silver/Grey then you have the SB 1500 Silver edition. If it's red, then you have the non-Silver edition.

                Given the OBP is so far out of date I'd start there. picld interacts with the HW and there have been many fixes introduced in the OBP (firmware) that need to be applied.

                You may also want to open up the unit and clean out any dust and rubbish from inside and around the fans. If we're unable to get clean airflow through the system it'll over heat quickly. Remember to follow ESD (Electrostatic Discharge) precautions so you don't destroy any hardware components.

                Finally, what OS do you have installed? (uname -a). As stated above picld talks to the HW so you may also want to verify the picld patch is up to date.

                If you still have issues after you've updated OBP, picld, and given the system a good clean then please raise a service request and we'll take a closer look at the hardware. It may be a sensor that's on it's way out.

                Regards,
                Steve
                • 5. Re: Blade 1500 Solaris 10 Issue
                  877113
                  Is there any way to get the flash update without a support contract? I am using a private property blade 1500 to prepare for an oracle certification exam and I am having similar troubles with the OBP.

                  Regards,
                  Razvan
                  • 6. Re: Blade 1500 Solaris 10 Issue
                    Soory
                    You need contract to download, patches, firmware and etc.