PeopleSoft Enterprise

Announcement

For appeals, questions and feedback about Oracle Forums, please email oracle-forums-moderators_us@oracle.com. Technical questions should be asked in the appropriate category. Thank you!

Grid Upgrade from 11.2.0.4 to 12.2.0.1 fails with "Died at /u02/app/12.2.0/grid/crs/install/crsupgra

DBA84May 11 2018 — edited May 22 2018

I try to upgrade grid infrastructure of a two node RAC 11.2.0.4 installed on Oracle Linux 7.4, test environment.

I read a lot of documentation and metalinks for prerequests before upgrade.

Some prerequests that I have done are as following:

1.I applied 6 patches (27475913,24422155,20348910,20898997,19855835,23186035) before Grid Infrastructure out-of-place rolling upgrade.

2.Execute Orachk with successful Oracle RAC Upgrade Readiness Report System Health Score 98% out of 100 which is ignorable warnings.

and etc.

After all prerequests take in consideration I execute the ./gridSetup.sh from Node 1

Everything is normal until rootupgrade.sh executed in the Node 2.

In Node 1 rootupgrade.sh script executed successfully and the output is like that

............................

CRS-2673: Attempting to stop 'ora.gipcd' on 'testrac1'

CRS-2677: Stop of 'ora.gipcd' on 'testrac1' succeeded

CRS-2793: Shutdown of Oracle High Availability Services-managed resources on 'testrac1' has completed

CRS-4133: Oracle High Availability Services has been stopped.

CRS-4123: Oracle High Availability Services has been started.

2018/05/11 16:48:15 CLSRSC-343: Successfully started Oracle Clusterware stack

2018/05/11 16:48:15 CLSRSC-595: Executing upgrade step 18 of 19: 'UpgradeNode'.

2018/05/11 16:48:17 CLSRSC-474: Initiating upgrade of resource types

2018/05/11 16:49:23 CLSRSC-482: Running command: 'srvctl upgrade model -s 11.2.0.4.0 -d 12.2.0.1.0 -p first'

2018/05/11 16:49:23 CLSRSC-475: Upgrade of resource types successfully initiated.

2018/05/11 16:49:32 CLSRSC-595: Executing upgrade step 19 of 19: 'PostUpgrade'.

2018/05/11 16:49:38 CLSRSC-325: Configure Oracle Grid Infrastructure for a Cluster ... succeeded

I make some checks from new Grid Home after successful rootupgrade.sh script execution in Node 1.

[oracle@testrac1 grid]$ cd /u02/app/12.2.0/grid/bin

[oracle@testrac1 bin]$ ./crsctl stat res -t

--------------------------------------------------------------------------------

Name Target State Server State details

--------------------------------------------------------------------------------

Local Resources

--------------------------------------------------------------------------------

ora.ASMNET1LSNR_ASM.lsnr ONLINE ONLINE testrac1 STABLE

OFFLINE OFFLINE testrac2 STABLE

ora.DATA.dg

ONLINE ONLINE testrac1 STABLE

ONLINE ONLINE testrac2 STABLE

ora.GIMR.dg

ONLINE ONLINE testrac1 STABLE

ONLINE ONLINE testrac2 STABLE

ora.LISTENER.lsnr

ONLINE ONLINE testrac1 STABLE

ONLINE ONLINE testrac2 STABLE

ora.net1.network

ONLINE ONLINE testrac1 STABLE

ONLINE ONLINE testrac2 STABLE

ora.ons

ONLINE ONLINE testrac1 STABLE

ONLINE ONLINE testrac2 STABLE

ora.proxy_advm

OFFLINE OFFLINE testrac1 STABLE

OFFLINE OFFLINE testrac2 STABLE

--------------------------------------------------------------------------------

Cluster Resources

--------------------------------------------------------------------------------

ora.LISTENER_SCAN1.lsnr

1 ONLINE ONLINE testrac1 STABLE

ora.LISTENER_SCAN2.lsnr

1 ONLINE ONLINE testrac2 STABLE

ora.LISTENER_SCAN3.lsnr

1 ONLINE ONLINE testrac2 STABLE

ora.asm

1 ONLINE ONLINE testrac1 Started,STABLE

2 ONLINE ONLINE testrac2 Started,STABLE

ora.oc4j

1 OFFLINE OFFLINE STABLE

ora.qosmserver

1 OFFLINE OFFLINE STABLE

ora.scan1.vip

1 ONLINE ONLINE testrac1 STABLE

ora.scan2.vip

1 ONLINE ONLINE testrac2 STABLE

ora.scan3.vip

1 ONLINE ONLINE testrac2 STABLE

ora.test.db

1 ONLINE ONLINE testrac1 Open,HOME=/u01/app/oracle/product/11.2.0/dbhome_1,STABLE

2 ONLINE ONLINE testrac2 Open,STABLE

ora.test.test_avisapp.svc

1 ONLINE ONLINE testrac2 STABLE

ora.test.test_avisjob.svc

1 ONLINE ONLINE testrac2 STABLE

ora.testrac1.vip

1 ONLINE ONLINE testrac1 STABLE

ora.testrac2.vip

1 ONLINE ONLINE testrac2 STABLE

--------------------------------------------------------------------------------

[oracle@testrac1 grid]$ crsctl query crs activeversion

Oracle Clusterware active version on the cluster is [11.2.0.4.0]

[oracle@testrac1 grid]$ crsctl query crs releaseversion

Oracle High Availability Services release version on the local node is [11.2.0.4.0]

[oracle@testrac1 grid]$ crsctl query crs softwareversion -all

Oracle Clusterware version on node [testrac1] is [12.2.0.1.0]

Oracle Clusterware version on node [testrac2] is [11.2.0.4.0]

But when rootupgrade.sh script executed in Node 2 the following error occurs. (The same error occured at two separate 2 node RAC installed machines - tested twice)

Output is like that (Here I executed rootupgrade.sh script twice with 15 minute interval )

[root@testrac2 ~]# /u02/app/12.2.0/grid/rootupgrade.sh

Performing root user operation.

The following environment variables are set as:

ORACLE_OWNER= oracle

ORACLE_HOME= /u02/app/12.2.0/grid

Enter the full pathname of the local bin directory: [/usr/local/bin]:

The file "dbhome" already exists in /usr/local/bin. Overwrite it? (y/n)

[n]:

The file "oraenv" already exists in /usr/local/bin. Overwrite it? (y/n)

[n]:

The file "coraenv" already exists in /usr/local/bin. Overwrite it? (y/n)

[n]:

Entries will be added to the /etc/oratab file as needed by

Database Configuration Assistant when a database is created

Finished running generic part of root script.

Now product-specific root actions will be performed.

Relinking oracle with rac_on option

Using configuration parameter file: /u02/app/12.2.0/grid/crs/install/crsconfig_params

The log of current session can be found at:

/u02/app/grid/crsdata/testrac2/crsconfig/rootcrs_testrac2_2018-05-11_05-27-55PM.log

2018/05/11 17:27:58 CLSRSC-595: Executing upgrade step 1 of 19: 'UpgradeTFA'.

2018/05/11 17:27:58 CLSRSC-4015: Performing install or upgrade action for Oracle Trace File Analyzer (TFA) Collector.

2018/05/11 17:28:46 CLSRSC-4003: Successfully patched Oracle Trace File Analyzer (TFA) Collector.

2018/05/11 17:28:46 CLSRSC-595: Executing upgrade step 2 of 19: 'ValidateEnv'.

2018/05/11 17:28:48 CLSRSC-595: Executing upgrade step 3 of 19: 'GenSiteGUIDs'.

2018/05/11 17:28:48 CLSRSC-595: Executing upgrade step 4 of 19: 'GetOldConfig'.

2018/05/11 17:28:48 CLSRSC-464: Starting retrieval of the cluster configuration data

2018/05/11 17:28:53 CLSRSC-465: Retrieval of the cluster configuration data has successfully completed.

2018/05/11 17:28:53 CLSRSC-595: Executing upgrade step 5 of 19: 'UpgPrechecks'.

2018/05/11 17:28:54 CLSRSC-363: User ignored prerequisites during installation

2018/05/11 17:28:55 CLSRSC-595: Executing upgrade step 6 of 19: 'SaveParamFile'.

2018/05/11 17:28:57 CLSRSC-595: Executing upgrade step 7 of 19: 'SetupOSD'.

2018/05/11 17:28:58 CLSRSC-595: Executing upgrade step 8 of 19: 'PreUpgrade'.

ASM configuration upgraded in local node successfully.

2018/05/11 17:29:01 CLSRSC-466: Starting shutdown of the current Oracle Grid Infrastructure stack

2018/05/11 17:38:14 CLSRSC-191: Failed to stop Oracle Clusterware stack

2018/05/11 17:38:14 CLSRSC-349: The Oracle Clusterware stack failed to stop

Died at /u02/app/12.2.0/grid/crs/install/crsupgrade.pm line 2990.

[root@testrac2 ~]# The command '/u02/app/12.2.0/grid/perl/bin/perl -I/u02/app/12.2.0/grid/perl/lib -I/u02/app/12.2.0/grid/crs/install /u02/app/12.2.0/grid/crs/install/rootcrs.pl -upgrade' execution failed

[root@testrac2 ~]# /u02/app/12.2.0/grid/rootupgrade.sh

Performing root user operation.

The following environment variables are set as:

ORACLE_OWNER= oracle

ORACLE_HOME= /u02/app/12.2.0/grid

Enter the full pathname of the local bin directory: [/usr/local/bin]:

The file "dbhome" already exists in /usr/local/bin. Overwrite it? (y/n)

[n]:

The file "oraenv" already exists in /usr/local/bin. Overwrite it? (y/n)

[n]:

The file "coraenv" already exists in /usr/local/bin. Overwrite it? (y/n)

[n]:

Entries will be added to the /etc/oratab file as needed by

Database Configuration Assistant when a database is created

Finished running generic part of root script.

Now product-specific root actions will be performed.

Relinking oracle with rac_on option

Using configuration parameter file: /u02/app/12.2.0/grid/crs/install/crsconfig_params

The log of current session can be found at:

/u02/app/grid/crsdata/testrac2/crsconfig/rootcrs_testrac2_2018-05-11_05-54-41PM.log

2018/05/11 17:54:41 CLSRSC-595: Executing upgrade step 1 of 19: 'UpgradeTFA'.

2018/05/11 17:54:41 CLSRSC-4015: Performing install or upgrade action for Oracle Trace File Analyzer (TFA) Collector.

2018/05/11 17:54:42 CLSRSC-4012: Shutting down Oracle Trace File Analyzer (TFA) Collector.

2018/05/11 17:56:20 CLSRSC-4013: Successfully shut down Oracle Trace File Analyzer (TFA) Collector.

2018/05/11 17:56:31 CLSRSC-4003: Successfully patched Oracle Trace File Analyzer (TFA) Collector.

2018/05/11 17:56:32 CLSRSC-595: Executing upgrade step 2 of 19: 'ValidateEnv'.

2018/05/11 17:56:33 CLSRSC-595: Executing upgrade step 3 of 19: 'GenSiteGUIDs'.

2018/05/11 17:56:33 CLSRSC-595: Executing upgrade step 4 of 19: 'GetOldConfig'.

2018/05/11 17:56:33 CLSRSC-464: Starting retrieval of the cluster configuration data

2018/05/11 17:56:38 CLSRSC-465: Retrieval of the cluster configuration data has successfully completed.

2018/05/11 17:56:38 CLSRSC-595: Executing upgrade step 5 of 19: 'UpgPrechecks'.

2018/05/11 17:56:39 CLSRSC-595: Executing upgrade step 6 of 19: 'SaveParamFile'.

2018/05/11 17:56:40 CLSRSC-595: Executing upgrade step 7 of 19: 'SetupOSD'.

2018/05/11 17:56:40 CLSRSC-595: Executing upgrade step 8 of 19: 'PreUpgrade'.

2018/05/11 17:58:41 CLSRSC-191: Failed to stop Oracle Clusterware stack

2018/05/11 17:58:41 CLSRSC-349: The Oracle Clusterware stack failed to stop

Died at /u02/app/12.2.0/grid/crs/install/crsupgrade.pm line 2915.

The command '/u02/app/12.2.0/grid/perl/bin/perl -I/u02/app/12.2.0/grid/perl/lib -I/u02/app/12.2.0/grid/crs/install /u02/app/12.2.0/grid/crs/install/rootcrs.pl -upgrade' execution failed

The lines 2990 and 2915 in /u02/app/12.2.0/grid/crs/install/crsupgrade.pm are like that

.......

2912 if (! $old_crs_running)

2913 {

2914 trace("Make sure the older stack is completely down");

2915 stopClusterware($oldcrshome, "crs") || die(dieformat(349));

2916 }

.......

2989 if (! stopClusterware($oldcrshome, "crs")) {

2990 die(dieformat(349));

2991 }

2992 print_info(467);

.......

I investigated about the failed script.

First I check all resources in Node 2.

[oracle@testrac2 bin]$ ./crsctl stat res -t -init

--------------------------------------------------------------------------------

NAME TARGET STATE SERVER STATE_DETAILS

--------------------------------------------------------------------------------

Cluster Resources

--------------------------------------------------------------------------------

ora.asm

1 ONLINE OFFLINE Instance Shutdown

ora.cluster_interconnect.haip

1 ONLINE OFFLINE

ora.crf

1 OFFLINE UNKNOWN testrac2

ora.crsd

1 ONLINE OFFLINE

ora.cssd

1 ONLINE OFFLINE

ora.cssdmonitor

1 OFFLINE OFFLINE

ora.ctssd

1 ONLINE OFFLINE

ora.diskmon

1 OFFLINE OFFLINE

ora.evmd

1 ONLINE OFFLINE

ora.gipcd

1 ONLINE OFFLINE

ora.gpnpd

1 ONLINE OFFLINE

ora.mdnsd

1 ONLINE OFFLINE

[oracle@testrac2 bin]$ ./crsctl stat res ora.crf -init

NAME=ora.crf

TYPE=ora.crf.type

TARGET=ONLINE

STATE=UNKNOWN on testrac2

I think the problem is related to ora.crf resource. Because I can't stop and start ora.crf resource cleanly.

When I try to stop following error occurs.

[root@testrac2 bin]# ./crsctl stop res ora.crf -init

CRS-2679: Attempting to clean 'ora.crf' on 'testrac2'

CRS-2680: Clean of 'ora.crf' on 'testrac2' failed

CRS-5804: Communication error with agent process

CRS-4000: Command Stop failed, or completed with errors.

[root@testrac2 bin]# ./crsctl start res ora.crf -init

CRS-2672: Attempting to start 'ora.mdnsd' on 'testrac2'

CRS-2676: Start of 'ora.mdnsd' on 'testrac2' succeeded

CRS-2672: Attempting to start 'ora.gpnpd' on 'testrac2'

CRS-2676: Start of 'ora.gpnpd' on 'testrac2' succeeded

CRS-2679: Attempting to clean 'ora.crf' on 'testrac2'

CRS-2680: Clean of 'ora.crf' on 'testrac2' failed

CRS-2673: Attempting to stop 'ora.gpnpd' on 'testrac2'

CRS-2677: Stop of 'ora.gpnpd' on 'testrac2' succeeded

CRS-2673: Attempting to stop 'ora.mdnsd' on 'testrac2'

CRS-2677: Stop of 'ora.mdnsd' on 'testrac2' succeeded

CRS-5804: Communication error with agent process

CRS-4000: Command Start failed, or completed with errors.

When I reboot the Node 2 server. Then ora.crf resource starts succesfully and everything seems to be ok

[oracle@testrac2 bin]$ ./crsctl stat res -t -init

--------------------------------------------------------------------------------

Name Target State Server State details

--------------------------------------------------------------------------------

Cluster Resources

--------------------------------------------------------------------------------

ora.asm

1 ONLINE ONLINE testrac2 Started,STABLE

ora.cluster_interconnect.haip