zm_tree 发表于 2014-10-17 15:19:48

ORA-00600: [1880], [0x01247D258], [0x01247CEE8], [1], [0], [], [], []

-----环境信息
$ uname -a
Linux gps02.yto.com 2.6.18-194.el5 #1 SMP Tue Mar 16 21:52:39 EDT 2010 x86_64 x86_64 x86_64 GNU/Linux

SQL> select * from v$version;
BANNER
--------------------------------------------------------------------------------
Oracle Database 10g Enterprise Edition Release 10.2.0.5.0 - 64bi
PL/SQL Release 10.2.0.5.0 - Production
CORE    10.2.0.5.0      Production
TNS for Linux: Version 10.2.0.5.0 - Production
NLSRTL Version 10.2.0.5.0 - Production

SQL> select * from registry$history;
no rows selected

双节点RAC ,

--日志信息
2节点一直报如下错误
Errors in file /opt/app/oracle/admin/gpsdb/bdump/gpsdb2_arc0_19489.trc:
ORA-00600: internal error code, arguments: , , , , , [], [], []
Fri Oct 17 15:07:29 CST 2014
Errors in file /opt/app/oracle/admin/gpsdb/bdump/gpsdb2_arc0_19489.trc:
ORA-00600: internal error code, arguments: , , , , , [], [], []
Fri Oct 17 15:07:29 CST 2014
Errors in file /opt/app/oracle/admin/gpsdb/bdump/gpsdb2_arc0_19489.trc:
ORA-00600: internal error code, arguments: , , , , , [], [], []
Fri Oct 17 15:07:31 CST 2014
ARCH: Detected ARCH process failure
ARCH: STARTING ARCH PROCESSES
ARC0: Archival started
ARCH: STARTING ARCH PROCESSES COMPLETE
ARC0 started with pid=50, OS id=19533
ARC0: Becoming the 'no FAL' ARCH
ARC0: Becoming the 'no SRL' ARCH
Fri Oct 17 15:07:34 CST 2014
Errors in file /opt/app/oracle/admin/gpsdb/bdump/gpsdb2_arc0_19533.trc:
ORA-00600: internal error code, arguments: , , , , , [], [], []
Fri Oct 17 15:07:35 CST 2014

1节点
ri Oct 17 15:08:27 CST 2014
Trace dumping is performing id=
Fri Oct 17 15:08:31 CST 2014
Trace dumping is performing id=
Fri Oct 17 15:08:34 CST 2014
Trace dumping is performing id=
Fri Oct 17 15:08:37 CST 2014
Trace dumping is performing id=
Fri Oct 17 15:10:26 CST 2014
Trace dumping is performing id=
Fri Oct 17 15:10:32 CST 2014
Trace dumping is performing id=
Fri Oct 17 15:10:38 CST 2014
Trace dumping is performing id=
Fri Oct 17 15:10:44 CST 2014
Trace dumping is performing id=
Fri Oct 17 15:10:50 CST 2014
Trace dumping is performing id=
Fri Oct 17 15:12:27 CST 2014
Trace dumping is performing id=
Fri Oct 17 15:12:33 CST 2014
Trace dumping is performing id=
Fri Oct 17 15:12:39 CST 2014
Trace dumping is performing id=
Fri Oct 17 15:12:45 CST 2014
Trace dumping is performing id=
Fri Oct 17 15:12:51 CST 2014
Trace dumping is performing id=
Fri Oct 17 15:13:48 CST 2014
Thread 1 advanced to log sequence 109008 (LGWR switch)
  Current log# 4 seq# 109008 mem# 0: +DATADG/gpsdb/redo04.l

---trc见附件



--备份采用的是datadomain的虚拟磁带库做备份,归档备份脚本如下,每6小时调一次
CONFIGURE RETENTION POLICY TO RECOVERY WINDOW OF 30 DAYS;
CONFIGURE CONTROLFILE AUTOBACKUP off;
run {   
   allocate channel t1 type 'SBT_TAPE';
   send channel t1 'NSR_ENV=(NSR_CLIENT=gps02,NSR_SERVER=bakserver,NSR_DATA_VOLUME_POOL=oraddboostb)';
   sql 'alter system archive log current';                                            
   backup filesperset 10
   format 'arch_%d_%I_%s_%p_%T'                                                               
   archivelog all not backed up 1 times skip inaccessible;
   DELETE NOPROMPT ARCHIVELOG UNTIL TIME "SYSDATE-7";
   release channel t1;   
}   

run                                                                                    
{                                                                                   
   allocate channel t1 type 'SBT_TAPE';
   send channel t1 'NSR_ENV=(NSR_CLIENT=gps02,NSR_SERVER=bakserver,NSR_DATA_VOLUME_POOL=oraddboostb)';
   backup format 'cntrl_%s_%p_%T' current controlfile;                                            
   backup spfile format 'spfile_%d_%I_%T_%t.ora';
   release channel t1;
}

通过MOS上发现Bug 4541506 : RMAN: ORA-600 DOING "BACKUP AS COPY" W/ MAXPC与我的报错很,包括函数堆栈调用,其patch无10.2.0.5

请问刘大,是否是其它bug导致? 若无法apply patch来解决此600错误,可通过什么方法来屏蔽错误或避免?

zm_tree 发表于 2014-10-17 15:39:48

利用手工运行dd去做归档日志的备份,失败日志如下

allocated channel: t1
channel t1: sid=485 instance=gpsdb2 devtype=SBT_TAPE
channel t1: NMDA Oracle v1.6.0
sent command to channel: t1

sent command to channel: t1

sql statement: alter system archive log current
released channel: t1
RMAN-00571: ===========================================================
RMAN-00569: =============== ERROR MESSAGE STACK FOLLOWS ===============
RMAN-00571: ===========================================================
RMAN-03009: failure of sql command on default channel at 10/17/2014 14:54:10
RMAN-11003: failure during parse/execution of SQL statement: alter system archive log current
ORA-00600: internal error code, arguments: , , , , , [], [], []

Recovery Manager complete.

Maclean Liu(刘相兵 发表于 2014-10-17 15:45:36

stack call

*** SESSION ID:(494.573) 2014-10-17 15:07:28.399
Archivelog compression complete.
  Input: 18789888 bytes Output: 10505652 bytes
  Compression Performance: 44.09 percent or 4.47 bits per byte
*** 2014-10-17 15:07:28.402

Oracle Database 10g Enterprise Edition Release 10.2.0.5.0 - 64bit Production
kcrrcda=>kcrrulfh=>ksfqwat

Maclean Liu(刘相兵 发表于 2014-10-17 15:52:33

      SO: 0x535572c40, type: 6, owner: 0x55a3f2990, flag: INIT/-/-/0x00
      (FIB) flags=324 reference cnt=0 incno=177 seqno=1
       fname=+FLASHAREA_DG/gpsdb/archivelog/2014_10_17/thread_2_seq_50620.4435.861203245
       fno=0 lblksz=512 fsiz=36700
        (FOB) flags=522 fib=0x535572c40 incno=177 pending i/o cnt=0
         fname=+FLASHAREA_DG/gpsdb/archivelog/2014_10_17/thread_2_seq_50620.4435.861203245
         fno=0 lblksz=512 fsiz=36700
      ------------------------------


感觉是 在线日志存在问题导致的arc0意外故障, 你确定现在系统 可以正常切换日志吗?

zm_tree 发表于 2014-10-17 15:59:07

alter system switch logfile 一直卡着


SQL> /

    GROUP#    THREAD#    MEMBERS STATUS
---------- ---------- ---------- --------------------------------
         1          1          1 INACTIVE
         2          1          1 INACTIVE
         3          1          1 ACTIVE
         4          1          1 CURRENT
         5          2          1 CURRENT
         6          2          1 INACTIVE
         7          2          1 INACTIVE
         8          2          1 INACTIVE

8 rows selected

zm_tree 发表于 2014-10-17 16:00:20

1节点可以正常alter system witch logfile

Maclean Liu(刘相兵 发表于 2014-10-17 16:03:33

在线日志可能损坏了, 这会导致2节点无法切换日志

zm_tree 发表于 2014-10-17 16:03:39

如何处理?

Maclean Liu(刘相兵 发表于 2014-10-17 16:06:23

http://www.askmaclean.com/archives/redo-curruption-%E3%80%90oracle%E6%95%B0%E6%8D%AE%E6%81%A2%E5%A4%8D%E3%80%91redo-log%E9%87%8D%E5%81%9A%E6%97%A5%E5%BF%97%E6%96%87%E4%BB%B6%E5%9D%8F%E5%9D%97corruption%E7%9A%84%E8%A7%A3%E5%86%B3-ora-16.html

zm_tree 发表于 2014-10-17 16:23:37

SQL> alter system switch logfile;
产生的等待事件是switch logfile command;

zm_tree 发表于 2014-10-20 09:04:06

       
SQL> select archivelog_compression from v$database ;

ARCHIVEL
--------
ENABLED ===>是因为之前开启压缩归档日志导致出现的错误,10版本不支持此特性,当取消归档压缩功能,问题解决。谢谢刘大给的宝贵意见
总结:一般这类问题,一般都是与日志有关系
页: [1]
查看完整版本: ORA-00600: [1880], [0x01247D258], [0x01247CEE8], [1], [0], [], [], []