rocolex 发表于 2013-12-16 16:10:21

RAC ocssd日志has a disk HB, but no network HB

系统环境:RHEL 6.4 x64
Oracle版本:11.2.0.4
RAC双节点11.2.0.4
GI_HOME /u01/app/11.2.0/grid_1
ORACLE_HOME /u01/app/oracle/product/11.2.0/dbhome_1
网卡信息
dbrac1 172.18.22.41 eth0
dbrac2 172.18.22.42 eth0
dbrac1-vip 172.18.22.43
dbrac2-vip 172.18.22.44
dbrac-scan 172.18.22.45
dbrac1-priv 10.10.10.10 eth1
dbrac2-priv 10.10.10.20 eth1

经过
2013年11月20日晚上安装,两台做过几次数据库shutdown abort,测试客户端,连接、进行中的查询情况。
2013年12月5日发现第二个节点dbrac2在11月20晚上23点多就GI就有问题。重启了节点2,GI、DB,crsctl stat res -t状态都正常。
2013年12月9日 (私有网卡,线重接,原来是双机直连)节点2集群停了,把私有网卡接到交换机上,再启
2013年12月13日 调整了redo日志大小。

疑问:2013月11月20日节点二问题的原因,当天的ocssd.log日志里很多的has a disk HB, but no network HB, DHB has rcfg,
12月5日重启后就在启动的时候has a disk HB, but no network HB, DHB has rcfg和gipchaLowerProcessNode: no valid interfaces found to node出现一次, 每次重启crs的时候都出现一次,但是服务的状态都正常

希望能看下

日志大了点

rocolex 发表于 2013-12-20 09:02:22

有人看看伐?

Liu Maclean(刘相兵 发表于 2013-12-20 12:10:14

$ grep -i network ocssd.log
2013-12-09 19:10:58.065: gipchaInternalReadGpnp: No network info configured in GPNP, using defaults, ret gipcretFail (1)
2013-12-09 19:11:13.699: [    CSSD]clssnmvDHBValidateNcopy: node 1, dbrac1, has a disk HB, but no network HB, DHB has rcfg 280446903, wrtcnt, 4914112, LATS 352755874, lastSeqNo 0, uniqueness 1384954638, timestamp 1386587473/1631722024
2013-12-09 19:11:13.699: [    CSSD]clssnmvDHBValidateNcopy: node 2, dbrac2, has a disk HB, but no network HB, DHB has rcfg 280446902, wrtcnt, 1093734, LATS 352755874, lastSeqNo 0, uniqueness 1386587451, timestamp 1386587405/352688084
2013-12-09 19:11:13.701: [    CSSD]clssnmvDHBValidateNcopy: node 1, dbrac1, has a disk HB, but no network HB, DHB has rcfg 280446903, wrtcnt, 4914113, LATS 352755874, lastSeqNo 0, uniqueness 1384954638, timestamp 1386587473/1631722344
2013-12-09 19:11:14.996: [    CSSD]clssnmvDHBValidateNcopy: node 1, dbrac1, has a disk HB, but no network HB, DHB has rcfg 280446903, wrtcnt, 4914115, LATS 352757164, lastSeqNo 4914112, uniqueness 1384954638, timestamp 1386587474/1631723024
2013-12-09 19:11:15.013: [    CSSD]clssnmvDHBValidateNcopy: node 1, dbrac1, has a disk HB, but no network HB, DHB has rcfg 280446903, wrtcnt, 4914116, LATS 352757184, lastSeqNo 4914113, uniqueness 1384954638, timestamp 1386587474/1631723344
2013-12-09 19:11:15.997: [    CSSD]clssnmvDHBValidateNcopy: node 1, dbrac1, has a disk HB, but no network HB, DHB has rcfg 280446903, wrtcnt, 4914118, LATS 352758174, lastSeqNo 4914115, uniqueness 1384954638, timestamp 1386587475/1631724024
2013-12-09 19:11:16.015: [    CSSD]clssnmvDHBValidateNcopy: node 1, dbrac1, has a disk HB, but no network HB, DHB has rcfg 280446903, wrtcnt, 4914119, LATS 352758184, lastSeqNo 4914116, uniqueness 1384954638, timestamp 1386587475/1631724344
2013-12-09 19:11:16.998: [    CSSD]clssnmvDHBValidateNcopy: node 1, dbrac1, has a disk HB, but no network HB, DHB has rcfg 280446903, wrtcnt, 4914121, LATS 352759174, lastSeqNo 4914118, uniqueness 1384954638, timestamp 1386587476/1631725024
2013-12-09 19:11:17.017: [    CSSD]clssnmvDHBValidateNcopy: node 1, dbrac1, has a disk HB, but no network HB, DHB has rcfg 280446903, wrtcnt, 4914122, LATS 352759184, lastSeqNo 4914119, uniqueness 1384954638, timestamp 1386587476/1631725344
2013-12-09 19:11:17.922: [ default]Could not use cellaffinity.ora (cannot find affinity map at '/etc/oracle/cell/network-config/cellaffinity.ora')
2013-12-09 19:11:17.922: [ default]Dumping configuration file '/etc/oracle/cell/network-config/cellaffinity.ora':
2013-12-09 19:11:17.922: [ default]Could not open '/etc/oracle/cell/network-config/cellaffinity.ora'
2013-12-09 19:11:17.922: [ default]    cellaffinity.ora status: cannot find affinity map at '/etc/oracle/cell/network-config/cellaffinity.ora' (see trace file for details)

Liu Maclean(刘相兵 发表于 2013-12-20 12:27:16

见3楼目前能找到的 最老的 has a disk HB, but no network HB, 是12月9日

且node 2、node 1均出现过,强烈怀疑 交换机或者网卡存在问题

rocolex 发表于 2013-12-23 14:28:39

Liu Maclean(刘相兵 发表于 2013-12-20 12:27 static/image/common/back.gif
见3楼目前能找到的 最老的 has a disk HB, but no network HB, 是12月9日

且node 2、node 1均出现过,强烈 ...

谢谢刘大!

cxh2014 发表于 2017-8-17 16:59:43

Liu Maclean(刘相兵 发表于 2013-12-20 12:10 static/image/common/back.gif
$ grep -i network ocssd.log
2013-12-09 19:10:58.065: gipchaInte ...

ipfrag_high_thresh   ipfrag_low_thresh  
页: [1]
查看完整版本: RAC ocssd日志has a disk HB, but no network HB