This discussion is archived
1 Reply Latest reply: Dec 5, 2012 9:32 AM by LaserSoft RSS

Error PENDING state, err(0) installing clusterware 10.2.0.4

user531958 Newbie
Currently Being Moderated
Hi,

I'm installing oracle clusterware 10.2.0.4 in Windows server 2008, when finish setting up proccess, It run some configuration, the first config failed(Oracle Clusterware Configuration Assistant).

the error is :

Step 5: Starting up CRS stack on all nodes
sdf01 service OracleCSService in improper PENDING state, err(0)
sdf02 service OracleCSService in improper PENDING state, err(0)


we can not use the 11g version because the ERP is not yet certified for this release.

the servers 2008 are virtual machine built on vmware.

the shared storage is external via SAN

My configuration is :

1. Clusterware 10.2.0.4

2. 2 nodes windows 2008R2 x 64

3. shared storage is raw partitions from SAN :
Volume ### Ltr Label Fs Type Size Status Info
Volume 0 D DVD-ROM 0 B No Media
Volume 1 C NTFS Partition 60 GB Healthy System
Volume 2 E New Volume NTFS Partition 180 GB Healthy Pagefile
Volume 3 RAW Partition 400 GB Healthy
Volume 4 RAW Partition 300 MB Healthy --- For ocr primary
Volume 5 RAW Partition 300 MB Healthy --- For voting disk ( copy 1 )
Volume 6 RAW Partition 300 MB Healthy --- For ocr mirror
Volume 7 RAW Partition 300 MB Healthy --- For voting disk ( copy 2 )
Volume 8 RAW Partition 300 MB Healthy --- For voting disk ( copy 3 )
Volume 9 RAW Partition 300 MB Healthy
Volume 10 RAW Partition 300 MB Healthy
Volume 11 RAW Partition 300 MB Healthy
Volume 12 RAW Partition 300 MB Healthy
Volume 13 RAW Partition 97 GB Healthy

4. firewall off in node1 and node2

5. My host files is
127.0.0.1 localhost
192.168.13.106 SDF01
192.168.13.107 SDF02
172.16.16.2 SDF01-PRIV
172.16.16.3 SDF02-PRIV
192.168.13.114 SDF01-VIRT
192.168.13.115 SDF02-VIRT

6. the execution runcluvfy.bat is ok for all items ( except for VIP address )

7. ocssd.log is :
[    CSSD]2012-11-30 16:14:25.247 [2648] >TRACE: clssnmReadNodeInfo: added node 1 (sdf01) to cluster
[    CSSD]2012-11-30 16:14:25.278 [2648] >TRACE: clssnmReadNodeInfo: added node 2 (sdf02) to cluster
[    CSSD]2012-11-30 16:14:25.294 [2776] >TRACE: clssnm_skgxninit: Compatible vendor clusterware not in use
[    CSSD]2012-11-30 16:14:25.294 [2776] >TRACE: clssnm_skgxnmon: skgxn init failed
[    CSSD]2012-11-30 16:14:25.294 [2648] >TRACE: clssnmNMInitialize: misscount set to (60)
[    CSSD]2012-11-30 16:14:25.310 [2648] >TRACE: clssnmNMInitialize: Network heartbeat thresholds are: impending reconfig 30000 ms, reconfig start (misscount) 60000 ms
[    CSSD]2012-11-30 16:14:25.325 [2648] >TRACE: clssnmDiskStateChange: state from 1 to 2 disk (0/\\.\votedsk1)
[    CSSD]2012-11-30 16:14:25.325 [2296] >TRACE: clssnmvDPT: spawned for disk 0 (\\.\votedsk1)
[    CSSD]2012-11-30 16:14:27.338 [2296] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (0/\\.\votedsk1)
[    CSSD]2012-11-30 16:14:27.338 [2648] >TRACE: clssscSclsFatal: read value of disable
[    CSSD]2012-11-30 16:14:27.338 [2672] >TRACE: clssnmvKillBlockThread: spawned for disk 0 (\\.\votedsk1) initial sleep interval (1000)ms
[    CSSD]2012-11-30 16:14:27.338 [2648] >TRACE: clssscSclsFatal: read value of disable
[    CSSD]2012-11-30 16:14:27.338 [2820] >TRACE: clssnmFatalThread: spawned
[    CSSD]2012-11-30 16:14:27.338 [3004] >TRACE: clssnmClusterListener: Listening on (ADDRESS=(PROTOCOL=tcp)(HOST=sdf01-priv)(PORT=49895))

[    CSSD]2012-11-30 16:14:27.338 [3004] >TRACE: clssnmClusterListener: Probing node sdf02 (2), probcon(00000000042CF410)
[    CSSD]2012-11-30 16:14:27.338 [2112] >TRACE: clssgmclientlsnr: listening on (ADDRESS=(PROTOCOL=tcp)(HOST=127.0.0.1)(PORT=61101))
[    CSSD]2012-11-30 16:14:27.338 [1052] >TRACE: clssgmPeerListener: Listening on (ADDRESS=(PROTOCOL=tcp)(DEV=1208)(HOST=172.16.16.2)(PORT=49641))
[    CSSD]2012-11-30 16:14:27.369 [2296] >TRACE: clssnmReadDskHeartbeat: node(2) is down. rcfg(-1) wrtcnt(-1) LATS(19921561) Disk lastSeqNo(-1)
[    CSSD]2012-11-30 16:14:28.336 [3004] >TRACE: clsc_send_msg: (00000000042CEDF0) NS err (12571, 12560), transport (533, 57, 0)

[    CSSD]2012-11-30 16:14:28.336 [3004] >TRACE: clssnmDiscHelper: sdf02, node(2) connection failed, con (00000000042CF410), probe(00000000042CF410)
[    CSSD]2012-11-30 16:14:34.436 [2676] >TRACE: clssnmRcfgMgrThread: Local Join
[    CSSD]2012-11-30 16:17:50.138 [2676] >WARNING: clssnmLocalJoinEvent: takeover succ
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmDoSyncUpdate: Initiating sync 1
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmDoSyncUpdate: diskTimeout set to (57000)ms
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmSetupAckWait: Ack message type (11)
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmSetupAckWait: node(1) is ALIVE
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmSendSync: syncSeqNo(1)
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmWaitForAcks: Ack message type(11), ackCount(1)
[    CSSD]2012-11-30 16:17:50.138 [3004] >TRACE: clssnmHandleSync: diskTimeout set to (57000)ms
[    CSSD]2012-11-30 16:17:50.138 [3004] >TRACE: clssnmHandleSync: Acknowledging sync: src[1] srcName[sdf01] seq[1] sync[1]
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmWaitForAcks: done, msg type(11)
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmDoSyncUpdate: node(1) is transitioning from joining state to active state
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmSetupAckWait: Ack message type (13)
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmSetupAckWait: node(1) is ACTIVE
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmWaitForAcks: Ack message type(13), ackCount(1)
[    CSSD]2012-11-30 16:17:50.138 [3004] >TRACE: clssnmSendVoteInfo: node(1) syncSeqNo(1)
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmWaitForAcks: done, msg type(13)
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmCheckDskInfo: Checking disk info...
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmCheckDskInfo: diskTimeout set to (200000)ms
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmEvict: Start
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmWaitOnEvictions: Start
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmSetupAckWait: Ack message type (15)
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmSetupAckWait: node(1) is ACTIVE
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmSendUpdate: syncSeqNo(1)
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmWaitForAcks: Ack message type(15), ackCount(1)
[    CSSD]2012-11-30 16:17:50.138 [3004] >TRACE: clssnmUpdateNodeState: node 0, state (0/0) unique (0/0) prevConuni(0) birth (0/0) (old/new)
[    CSSD]2012-11-30 16:17:50.138 [3004] >TRACE: clssnmUpdateNodeState: node 1, state (2/3) unique (1354310065/1354310065) prevConuni(0) birth (1/1) (old/new)
[    CSSD]2012-11-30 16:17:50.138 [3004] >TRACE: clssnmUpdateNodeState: node 2, state (0/0) unique (0/0) prevConuni(0) birth (0/0) (old/new)
[    CSSD]2012-11-30 16:17:50.138 [3004] >USER: clssnmHandleUpdate: SYNC(1) from node(1) completed
[    CSSD]2012-11-30 16:17:50.138 [3004] >USER: clssnmHandleUpdate: NODE 1 (sdf01) IS ACTIVE MEMBER OF CLUSTER
[    CSSD]2012-11-30 16:17:50.138 [3004] >TRACE: clssnmHandleUpdate: diskTimeout set to (200000)ms
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmWaitForAcks: done, msg type(15)
[    CSSD]2012-11-30 16:17:50.138 [2676] >TRACE: clssnmDoSyncUpdate: Sync 1 complete!
[    CSSD]2012-11-30 16:17:50.153 [2296] >ERROR: clssnmvReadFatal: voting device corrupt (0xffffffff/0xffffffff/0/\\.\votedsk1)
[    CSSD]2012-11-30 16:17:50.231 [2648] >USER: NMEVENT_SUSPEND [00][00][00][00]
[    CSSD]2012-11-30 16:17:50.231 [2876] >TRACE: clssgmReconfigThread: started for reconfig (1)
[    CSSD]2012-11-30 16:17:50.231 [2876] >USER: NMEVENT_RECONFIG [00][00][00][02]
[    CSSD]2012-11-30 16:17:50.231 [2876] >TRACE: clssgmEstablishConnections: 1 nodes in cluster incarn 1
[    CSSD]2012-11-30 16:17:50.231 [1052] >TRACE: clssgmPeerListener: connects done (1/1)
[    CSSD]2012-11-30 16:17:50.231 [2876] >TRACE: clssgmEstablishMasterNode: MASTER for 1 is node(1) birth(1)
[    CSSD]2012-11-30 16:17:50.231 [2876] >TRACE: clssgmChangeMasterNode: requeued 0 RPCs
[    CSSD]2012-11-30 16:17:50.231 [2876] >TRACE: clssgmMasterCMSync: Synchronizing group/lock status
[    CSSD]2012-11-30 16:17:50.231 [2876] >TRACE: clssgmMasterSendDBDone: group/lock status synchronization complete
[    CSSD]CLSS-3000: reconfiguration successful, incarnation 1 with 1 nodes

[    CSSD]CLSS-3001: local node number 1, master node number 1

[    CSSD]2012-11-30 16:17:50.231 [2876] >TRACE: clssgmReconfigThread: completed for reconfig(1), with status(1)
[    CSSD]2012-11-30 16:19:22.599 [2540] >TRACE: clsc_send_msg: (000000000451EA10) NS err (12571, 12560), transport (533, 57, 0)

[    CSSD]2012-11-30 16:19:22.755 [2112] >WARNING: clssgmShutDown: Received explicit shutdown request from client.
[    CSSD]2012-11-30 16:19:22.755 [2112] >WARNING: clssgmClientShutdown: graceful shutdown completed.


I appreciate any help

thanks

Edited by: user531958 on 3/12/2012 12:09 PM

Legend

  • Correct Answers - 10 points
  • Helpful Answers - 5 points