9 Replies Latest reply on Aug 29, 2008 3:42 AM by 26741

    Rman Backup Fail

    549698
      Hi Friends,

      I'm trying to configure RMAN backup on Oracle 9.2.0.6 64bit EE Database. Database is running on No-Archive mode, and RMAN Offline backup is scheduled once a week.My backup works sometimes and most of the time it Fails. I wonder what could be the reason. Bellow is RMAN configuration on the database,The script to run offline backup and the Error Msg. I'm looking for help to fix this.

      1) RMAN config

      RMAN> connect target

      connected to target database: RADPRDMH (DBID=3693458060)

      RMAN> show all;

      using target database controlfile instead of recovery catalog
      RMAN configuration parameters are:
      CONFIGURE RETENTION POLICY TO RECOVERY WINDOW OF 1 DAYS;
      CONFIGURE BACKUP OPTIMIZATION ON;
      CONFIGURE DEFAULT DEVICE TYPE TO 'SBT_TAPE';
      CONFIGURE CONTROLFILE AUTOBACKUP ON;
      CONFIGURE CONTROLFILE AUTOBACKUP FORMAT FOR DEVICE TYPE 'SBT_TAPE' TO '%F';
      CONFIGURE CONTROLFILE AUTOBACKUP FORMAT FOR DEVICE TYPE DISK TO '%F';
      CONFIGURE DEVICE TYPE DISK PARALLELISM 1;
      CONFIGURE DEVICE TYPE 'SBT_TAPE' PARALLELISM 1;
      CONFIGURE DATAFILE BACKUP COPIES FOR DEVICE TYPE 'SBT_TAPE' TO 1;
      CONFIGURE DATAFILE BACKUP COPIES FOR DEVICE TYPE DISK TO 1;
      CONFIGURE ARCHIVELOG BACKUP COPIES FOR DEVICE TYPE 'SBT_TAPE' TO 1;
      CONFIGURE ARCHIVELOG BACKUP COPIES FOR DEVICE TYPE DISK TO 1;
      CONFIGURE CHANNEL DEVICE TYPE DISK FORMAT '/u02/backup/SID_df_%t_%s_%p';
      CONFIGURE MAXSETSIZE TO UNLIMITED;
      CONFIGURE SNAPSHOT CONTROLFILE NAME TO '/u01/oracle/dbs/snapcf_SID.f'; # default


      2) RMAN Script
      run {

      SHUTDOWN IMMEDIATE;
      STARTUP MOUNT;
      allocate channel c1 type 'SBT_TAPE' parms
      'ENV=(TDPO_OPTFILE=/usr/tivoli/tsm/client/oracle/bin64/tdpo.opt)';
      BACKUP DATABASE;
      ALTER DATABASE OPEN;
      release channel c1;
      }

      3) Error Msg

      RMAN-00571: ===========================================================
      RMAN-00569: =============== ERROR MESSAGE STACK FOLLOWS ===============
      RMAN-00571: ===========================================================
      RMAN-03009: failure of backup command on c1 channel at 08/22/2008 23:30:09
      ORA-19502: write error on file "6gjon78h_1_1", blockno 134996481
      (blocksize=512)
      ORA-27030: skgfwrt: sbtwrite2 returned error
      ORA-19511: Error received from media manager layer, error text:
      ANS1235E (RC-72) An unknown system error has occurred from which TSM
      cannot recover.
        • 1. Re: Rman Backup Fail
          26741
          Oracle was unable to write to the Tape Drive. The "unknown error" is from the Media Managment Layer (TSM -- Tivoli ??)
          "ANS1235E (RC-72) An unknown system error has occurred from which TSM cannot recover."

          You must look for errors in Tivoli.

          Edited by: Hemant K Chitale on Aug 26, 2008 12:11 PM
          • 2. Re: Rman Backup Fail
            247514
            also this part of Oracle doc might be useful,

            [Interpreting RMAN Message Output|http://download.oracle.com/docs/cd/B19306_01/backup.102/b14191/rcmtroub001.htm#BRADV174]
            • 3. Re: Rman Backup Fail
              Surachart Opun
              Try to check trace file on USER_DUMP_DEST (sbtio.log)


              From:

              ORA-27030: skgfwrt: sbtwrite2 returned error
              ORA-19511: Error received from media manager layer, error text:
              ANS1235E (RC-72) An unknown system error has occurred from which TSM
              cannot recover.


              Anyway,
              This message can not provide enough the problem, you should get more error from Media Management software.
              • 4. Re: Rman Backup Fail
                584650
                Check the TSM error log for more information. The problem is with the media manager not RMAN. Verify the correct parameters were passed in the allocate channel.
                • 5. Re: Rman Backup Fail
                  549698
                  Hi,

                  If I would to check the error log from dsm.sys, I get following error msg:-
                  Could it because of my RMAN configuration?

                  08/22/08 23:30:00 Error -50 sending request
                  08/22/08 23:30:01 ANS1235E An unknown system error has occurred from which TSM cannot recover.
                  08/22/08 23:30:01 ANS1235E An unknown system error has occurred from which TSM cannot recover.
                  08/22/08 23:30:01 sessSendVerb: Error sending Verb, rc: -71
                  08/22/08 23:30:11 ANS4994S TDP Oracle AIX ANU0599 TDP for Oracle: (46138): =>(XXX01_ora) ANU2602E The object /SID//6gjon78h_1_1 was not found on the TSM Server
                  • 6. Re: Rman Backup Fail
                    26741

                    TSM is trying to update a file which doesn't exist yet. Oracle didn't ask it to update a file -- Oracle is sending a full backup but TSM is trying to merge the backup with a supposedly existing backup. Talk to your TSM administrator or Support.


                    • 7. Re: Rman Backup Fail
                      549698
                      Hi,

                      I'm working with the TSM administrator, while googling for some help, I'm now tried testing the sbt.Following are the steps I tested,please help if you can understand the error msg.

                      1) $ORACLE_HOME/bin/sbttest

                      2) chmod 6751 $ORACLE_HOME/bin/sbttest

                      3) sbttest SID
                      The sbt function pointers are loaded from libobk.a(shr.o) library.
                      -- sbtinit succeeded
                      Return code -1 from sbtinit, bsercoer = 0, bsercerrno = 0
                      Message 0 not found; product=RDBMS; facility=SBT

                      4) $ more tdpoerror.log
                      08/22/2008 23:30:01 TID<46138> ==> ANU2539E sbtwrite2(): Error - buf pointer is NULL.
                      red from which TSM cannot recover.

                      02E Invalid command:
                      10/09/2006 15:55:08 ANU0102E Invalid command:
                      10/09/2006 15:57:42 ANU0102E Invalid command:
                      08/26/2008 15:48:54 TID<35060> ==> tdpoInit(): Error - Could not initialize InitOrcBlock()
                      08/26/2008 16:17:10 TID<37994> ==> tdpoInit(): Error - Could not initialize InitOrcBlock()
                      • 8. Re: Rman Backup Fail
                        549698
                        Hi Friends,

                        Can anyone help me out with this? I need to get this fixed. The problem is sometimes my backup goes trough and most of the time it does not. it's like trying at the same time for 5 times and the 6th time it goes trough. Could it be due to the time,or some pick time or what exactly could it be? any idea?

                        Appriciate your reply.
                        • 9. Re: Rman Backup Fail
                          26741
                          Two suggestions :

                          1. Run RMAN Backups to disk.


                          2. set OPTIMIZATION OFF. Do you have any datafiles that are read only / set to read only / not updated on that 6th day ?

                          See [http://download.oracle.com/docs/cd/B19306_01/backup.102/b14191/rcmconc1008.htm#sthref314]