- 最后登录
- 2015-3-6
- 在线时间
- 80 小时
- 威望
- 75
- 金钱
- 557
- 注册时间
- 2012-4-11
- 阅读权限
- 50
- 帖子
- 74
- 精华
- 0
- 积分
- 75
- UID
- 338
|
1#
发表于 2013-9-11 11:56:27
|
查看: 7026 |
回复: 5
环境介绍:
AIX 6100-06-05-1115
数据库:
Oracle 11.2.0.2 RAC+ASM 2 nodes
问题现象:(asm alert log)- ORA-27063: number of bytes read/written is incorrect
- IBM AIX RISC System/6000 Error: 110: Media surface error
- Additional information: -1
- Additional information: 1048576
- WARNING: Read Failed. group:2 disk:2 AU:23579 offset:0 size:1048576
- Errors in file /home/grid/asm/diag/asm/+asm/+ASM2/trace/+ASM2_ora_3867494.trc:
- ORA-27063: number of bytes read/written is incorrect
- IBM AIX RISC System/6000 Error: 110: Media surface error
- Additional information: -1
- Additional information: 1048576
- WARNING: Read Failed. group:2 disk:2 AU:23580 offset:0 size:1048576
- WARNING: failed to read mirror side 1 of virtual extent 2219 logical extent 0 of file 278 in group [2.813357336] from disk DATADG_0002 allocation unit 23580 reason error; if possible,will try another mirror side
- WARNING: failed to read mirror side 1 of virtual extent 2216 logical extent 0 of file 278 in group [2.813357336] from disk DATADG_0002 allocation unit 23579 reason error; if possible,will try another mirror side
复制代码 trace信息:- *** 2013-09-11 09:47:59.441
- *** SESSION ID:(123.53) 2013-09-11 09:47:59.441
- *** CLIENT ID:() 2013-09-11 09:47:59.441
- *** SERVICE NAME:() 2013-09-11 09:47:59.441
- *** MODULE NAME:(dbverify@zsydjy2 (TNS V1-V3)) 2013-09-11 09:47:59.441
- *** ACTION NAME:() 2013-09-11 09:47:59.441
-
- ORA-27063: number of bytes read/written is incorrect
- IBM AIX RISC System/6000 Error: 110: Media surface error
- Additional information: -1
- Additional information: 1048576
- WARNING: Read Failed. group:2 disk:2 AU:23579 offset:0 size:1048576
- [color=Red]path:/dev/rhdiskpower5[/color]
- incarnation:0xea9a29f3 asynchronous result:'I/O error'
- subsys:System iop:0x110c7d9a0 bufp:0x1108d9000 osderr:0x0 osderr1:0x0
- ORA-27063: number of bytes read/written is incorrect
- IBM AIX RISC System/6000 Error: 110: Media surface error
- Additional information: -1
- Additional information: 1048576
- WARNING: Read Failed. group:2 disk:2 AU:23580 offset:0 size:1048576
- [color=Red]path:/dev/rhdiskpower5[/color]
- incarnation:0xea9a29f3 asynchronous result:'I/O error'
- subsys:System iop:0x110c7d740 bufp:0x110cf9000 osderr:0x0 osderr1:0x0
- WARNING: failed to read mirror side 1 of virtual extent 2219 logical extent 0 of file 278 in group [2.813357336] from disk DATADG_0002 allocation unit 23
- 580 reason error; if possible,will try another mirror side
- WARNING: failed to read mirror side 1 of virtual extent 2216 logical extent 0 of file 278 in group [2.813357336] from disk DATADG_0002 allocation unit 23
- 579 reason error; if possible,will try another mirror side
复制代码 存储上已查看信息,并没有出现错误。
通过日志提示AU:23579和AU:23580对应及ASM磁盘分布的信息:- SQL> select group_kffxp,number_kffxp from x$kffxp where au_kffxp in (23579,23580);
- GROUP_KFFXP NUMBER_KFFXP
- ----------- ------------
- 2 278
- 2 278
- 2 278
- 2 278
- 2 278
- 2 278
- SQL> select name from v$asm_alias where FILE_NUMBER=278;
- NAME
- --------------------------------------------------------------------------------
- USERS.278.818266419
- QL> select GROUP_KFFXP, DISK_KFFXP, AU_KFFXP from x$kffxp
- 2 where number_kffxp =
- 3 (select file_number
- 4 from v$asm_alias
- 5 where name = 'USERS.278.818266419') and AU_KFFXP in (23579,23580);
- GROUP_KFFXP DISK_KFFXP AU_KFFXP
- ----------- ---------- ----------
- 2 0 23579
- 2 0 23580
- 2 1 23579
- 2 1 23580
- 2 2 23579
- 2 2 23580
复制代码 可以看出,这两个AU同时分布在磁盘组的三个磁盘上:- SQL> select path,DISK_NUMBER from v$asm_disk where GROUP_NUMBER=2;
- PATH DISK_NUMBER
- ---------------------------------------- -----------
- /dev/rhdiskpower3 0
- /dev/rhdiskpower4 1
- /dev/rhdiskpower5 2
复制代码 测试该数据文件中包涵数据对象的可用性:- SQL> select file# from v$datafile where name like '%users.278%';
- FILE#
- ----------
- 14
- SQL> col SEGMENT_NAME for a40
- SQL> col owner for a30
- SQL> set lines 200 pages 100
- SQL> select owner, segment_name, segment_type from dba_extents where file_id=14 and rownum < 20;
- OWNER SEGMENT_NAME SEGMENT_TYPE
- ------------------------------ ---------------------------------------- ------------------
- STCXLINK JY_LOG TABLE
- STCXLINK JY_LOG TABLE
- STCXLINK JY_LOG TABLE
- STCXLINK2 JY_LOG TABLE
- STCXLINK2 JY_LOG TABLE
- XLINK2 JY_LOG TABLE
- XLINK2 JY_LOG TABLE
- XLINK2 JY_LOG TABLE
- XLINK2 JY_LOG TABLE
- STCXLINK PK_JY_LOG INDEX
- XLINK2 PK_JY_LOG INDEX
- XLINK2 PK_JY_LOG INDEX
- XLINK2 IDX_JY_LOG_01 INDEX
- STCXLINK2 PK_JY_LOG INDEX
- STCXLINK2 IDX_JY_LOG_01 INDEX
- STCXLINK2 IDX_JY_LOG_01 INDEX
- STCXLINK2 IDX_JY_LOG_01 INDEX
- SQL> select count(*) from STCXLINK.JY_LOG;
- COUNT(*)
- ----------
- 4649782
复制代码 备注:1号节点系统、ASM alert、DB alert都没有上述错误信息,数据库可以正常使用
通过MOS上找到了一些比较有价值的信息(1289905.1)。
这种情况是否可以通过替换ASM磁盘组磁盘来临时解决,停库升级的时间比较难申请。
|
|