xia 发表于 2014-9-4 22:44:55

ORA-00600 [6749]的问题,附trc

本帖最后由 xia 于 2014-9-4 22:50 编辑

RAC 环境

Oracle Database 11g Enterprise Edition Release 11.2.0.3.0 - 64bit Production
PL/SQL Release 11.2.0.3.0 - Production
CORE    11.2.0.3.0      Production
TNS for IBM/AIX RISC System/6000: Version 11.2.0.3.0 - Production
NLSRTL Version 11.2.0.3.0 - Production


下午在告警日志中看到如下信息:

Thread 1 advanced to log sequence 16646 (LGWR switch)
  Current log# 102 seq# 16646 mem# 0: +ORADATA/portaldb/redo02.log
Thu Sep 04 16:48:52 2014
Errors in file /oracle/diag/rdbms/portaldb/portaldb1/trace/portaldb1_ora_40698472.trc  (incident=642694):
ORA-00600: internal error code, arguments: , , , , [], [], [], [], [], [], [], []
Incident details in: /oracle/diag/rdbms/portaldb/portaldb1/incident/incdir_642694/portaldb1_ora_40698472_i642694.trc
Thu Sep 04 16:48:56 2014
Dumping diagnostic data in directory=, requested by (instance=1, osid=40698472), summary=.
Use ADRCI or Support Workbench to package the incident.
See Note 411.1 at My Oracle Support for error and packaging details.
Thu Sep 04 16:48:58 2014
Sweep : completed
Sweep : completed
Thu Sep 04 16:53:14 2014
Thread 1 advanced to log sequence 16647 (LGWR switch)
  Current log# 103 seq# 16647 mem# 0: +ORADATA/portaldb/redo103.log


找到了portaldb1_ora_40698472.trc (大约1.9G) 和portaldb1_ora_40698472_i642694.trc(11M左右) 两个文件 ;

oracle@ptdb1:/home/oracle/awrrpt_pack$du -sm portaldb1_ora_40698472.trc
1830.01 portaldb1_ora_40698472.trc               

oracle@ptdb1:/home/oracle/awrrpt_pack$du -sm portaldb1_ora_40698472_i642694.trc
11.46   portaldb1_ora_40698472_i642694.trc


portaldb1_ora_40698472.trc这个文件是去不来了我把前面重要的几行复制下来;

UMP REDO
Opcodes 11.*
DBAs (file#, block#):
(9, 174929) .
SCNs: scn: 0x0000.00000000 thru scn: 0xffff.ffffffff
**NOTE: Only Dumping Redo less then 12 hours**
Times: 09/04/2014 04:41:29 thru eternity
Initial buffer sizes: read 1024K, overflow 832K, change 805K
Thread 1 low checkpoint scn: 0x0001.4c4c6be0
Thread 2 low checkpoint scn: 0x0001.4c4cb42f
SCN Start Scan Point: scn: 0x0001.4c4cb42f (5575062575)
Initial buffer sizes: read 1024K, overflow 832K, change 805K
Initial buffer sizes: read 1024K, overflow 832K, change 805K
INCARNATION:
  START: scn: 0x0000.00000001 (1) Timestamp:  06/01/2013 17:58:46
  END: scn: 0xffff.ffffffff
descrip:"Thread 0001, Seq# 0000016485, SCN 0x00014c4c6be0-0x00014c4f53f2"
descrip:"Thread 0002, Seq# 0000018304, SCN 0x00014c4cb42f-0x00014c4ea321"

*** 2014-09-04 16:41:30.442
*Error - Unable to open log for Thread 2 at SCN: scn: 0x0001.4c4ea321 (5575189281)
END OF DUMP REDO
Dumping current redo log in thread 1
Initial buffer sizes: read 1024K, overflow 832K, change 805K

DUMP OF REDO FROM FILE '+ORADATA/portaldb/redo02.log'
Opcodes 11.*
DBAs (file#, block#):
(9, 174929) .
RBAs: 0x000000.00000000.0000 thru 0xffffffff.ffffffff.ffff
SCNs: scn: 0x0001.4c4cb42f (5575062575) thru scn: 0xffff.ffffffff
Times: 09/04/2014 04:41:29 thru eternity
FILE HEADER:
        Compatibility Vsn = 186646528=0xb200000
        Db ID=663718102=0x278f88d6, Db Name='PORTALDB'
        Activation ID=663749334=0x279002d6
        Control Seq=1800459=0x1b790b, File size=1024000=0xfa000
        File Number=102, Blksiz=512, File Type=2 LOG
descrip:"Thread 0001, Seq# 0000016641, SCN 0x0001502f1901-0xffffffffffff"
thread: 1 nab: 0xffffffff seq: 0x00004101 hws: 0x1 eot: 1 dis: 0
resetlogs count: 0x30b21356 scn: 0x0000.00000001 (1)
prev resetlogs count: 0x0 scn: 0x0000.00000000
Low  scn: 0x0001.502f1901 (5640231169) 09/04/2014 16:39:02
Next scn: 0xffff.ffffffff 01/01/1988 00:00:00
Enabled scn: 0x0000.00000001 (1) 06/01/2013 17:58:50
Thread closed scn: 0x0001.502f1901 (5640231169) 09/04/2014 16:39:02


20:05:59 sys@PORTALDB> column SEGMENT_NAME format a60
20:07:10 sys@PORTALDB> column owner format a10
20:07:38 sys@PORTALDB> column TABLESPACE_NAME format a10
20:07:49 sys@PORTALDB> SELECT tablespace_name,segment_type,owner,segment_name FROM dba_extents  wHERE file_id =9   and 174929 between block_id AND block_id + blocks - 1;

TABLESPACE SEGMENT_TYPE       OWNER      SEGMENT_NAME
---------- ------------------ ---------- ------------------------------------------------------------
NEANDS     TABLE              NEANDS3    M_PRODUCT_ALIAS           //该表为条码表

Elapsed: 00:00:04.09


20:08:07 sys@PORTALDB>  select table_name, iot_name from all_tables where table_name='M_PRODUCT_ALIAS';

TABLE_NAME           IOT_NAME
-------------------- ------------------------------
M_PRODUCT_ALIAS

21:59:18 sys@PORTALDB> select do.owner,do.object_name, do.object_type,sysind.flags
22:10:33   2       from dba_objects do, sys.ind$ sysind
22:10:33   3       where do.object_id = sysind.obj#
22:10:33   4       and bitand(sysind.flags,4096)=4096;

no rows selected

其中portaldb1_ora_40698472_i642694.trc文件中的这个语句是不是造成这个报错的原因呢?已经把trc文件 和这个表给开发那边了,让开发那边也看看,

*** 2014-09-04 16:48:52.213
dbkedDefDump(): Starting incident default dumps (flags=0x2, level=3, mask=0x0)
----- Current SQL Statement for this session (sql_id=f40kjffsrvhvp) -----
UPDATE M_PRODUCT_ALIAS A SET A.IS_TOWMS = 'N' WHERE A.M_PRODUCT_ID = :B1

我还摸不着头绪,把故障现象发到这里,希望大家帮我看看,谢谢

wisdomone1 发表于 2014-9-7 13:41:05

请参考,http://blog.itpub.net/9240380/viewspace-1265450/

Maclean Liu(刘相兵 发表于 2014-9-7 23:32:59

7705591        Corruption with self-referenced row in MSSM tablespace. Wrong Results / OERI / ORA-8102
UPDATE M_PRODUCT_ALIAS A SET A.IS_TOWMS = 'N' WHERE A.M_PRODUCT_ID = :B1

Dump continued from file: /oracle/diag/rdbms/portaldb/portaldb1/trace/portaldb1_ora_40698472.trc
ORA-00600: internal error code, arguments: , , , , [], [], [], [], [], [], [], []


      pin: 'kduwh18: kdu_array_flush' waiters: 1
      addr: 0x7000001ff813088 obj: 161302 cls: DATA bscn: 0x1.50311147
          buffer tsn: 15 rdba: 0x0242ab51 (9/174929)
          scn: 0x0001.50311147 seq: 0x01 flg: 0x04 tail: 0x11470601
          frmt: 0x02 chkval: 0x2018 type: 0x06=trans data


ock header dump:  0x0242ab51
Object id on Block? Y
seg/obj: 0x27616  csc: 0x01.503110d9  itc: 8  flg: -  typ: 1 - DATA
     fsl: 0  fnx: 0x0 ver: 0x01

0x158:pri        offs=0x85c


tab 0, row 163, @0x85c
tl: 2 fb: --HDFL-- lb: 0x1

Maclean Liu(刘相兵 发表于 2014-9-7 23:37:21

odm finding :


  Format: ORA-600

VERSIONS:           
  versions 7.3 to 8.1

DESCRIPTION:        

  This internal occurs during the row change for update or delete.
  Some reported problems are consistent read issue, some bugs are related to  
  row corruption on the block.

FUNCTIONALITY:      
  ROW DELETE

IMPACT:            
  PROCESS FAILURE
  ROW CORRUPTION


, , , , [], [], [], [], [], [], [], []


3 代表 row no longer exists 行不再存在
37923665 代表dba
C 是 行的slot number

由于11.2.0.3 上似乎没有对应的Bug,考虑对该表做ANALYZE TABLE <table_name> VALIDATE STRUCTURE CASCADE; 并flush buffer_cache; 如果错误仍存在,考虑重建该表
页: [1]
查看完整版本: ORA-00600 [6749]的问题,附trc