- 最后登录
- 2023-8-16
- 在线时间
- 1686 小时
- 威望
- 2135
- 金钱
- 50532
- 注册时间
- 2011-10-12
- 阅读权限
- 200
- 帖子
- 5207
- 精华
- 39
- 积分
- 2135
- UID
- 2
|
5#
发表于 2012-8-11 21:39:15
- 2012-08-10 17:45:05.213: [ CSSD][1075632448]clssgmJoinGrock: global grock CRF- new client 0x2aaab004ada0 with con 0x8568, requested num -1, flags 0x4000e00
- 2012-08-10 17:45:05.213: [ CSSD][1075632448]clssgmJoinGrock: ignoring grock join for client not requiring fencing until group information has been received from the master; group name CRF-, member number -1, flags 0x4000e00
- 2012-08-10 17:45:05.213: [ CSSD][1075632448]clssgmDiscEndpcl: gipcDestroy 0x8568
- 2012-08-10 17:45:05.214: [ CSSD][1075632448]clssgmDeadProc: proc 0x2aaab002d570
- 2012-08-10 17:45:05.214: [ CSSD][1075632448]clssgmDestroyProc: cleaning up proc(0x2aaab002d570) con(0x8539) skgpid ospid 15244 with 0 clients, refcount 0
- 2012-08-10 17:45:05.214: [ CSSD][1075632448]clssgmDiscEndpcl: gipcDestroy 0x8539
- 2012-08-10 17:45:05.267: [ CSSD][1105119552]clssgmWaitOnEventValue: after CmInfo State val 3, eval 1 waited 0
- 2012-08-10 17:45:05.590: [ CSSD][1096227136]clssnmvDHBValidateNCopy: node 2, node-rac2, has a disk HB, but no network HB, DHB has rcfg 234386141, wrtcnt, 833242, LATS 6165464, lastSeqNo 833241, uniqueness 1344566603, timestamp 1344591904/24678294
- 2012-08-10 17:45:06.269: [ CSSD][1105119552]clssgmWaitOnEventValue: after CmInfo State val 3, eval 1 waited 0
- 2012-08-10 17:45:06.593: [ CSSD][1096227136]clssnmvDHBValidateNCopy: node 2, node-rac2, has a disk HB, but no network HB, DHB has rcfg 234386141, wrtcnt, 833243, LATS 6166464, lastSeqNo 833242, uniqueness 1344566603, timestamp 1344591905/24679304
- 2012-08-10 17:45:06.732: [ CSSD][1108273472]clssnmSendingThread: sending join msg to all nodes
- 2012-08-10 17:45:06.732: [ CSSD][1108273472]clssnmSendingThread: sent 4 join msgs to all nodes
- 2012-08-10 17:45:07.271: [ CSSD][1105119552]clssgmWaitOnEventValue: after CmInfo State val 3, eval 1 waited 0
- 2012-08-10 17:45:07.613: [ CSSD][1096227136]clssnmvDHBValidateNCopy: node 2, node-rac2, has a disk HB, but no network HB, DHB has rcfg 234386141, wrtcnt, 833244, LATS 6167484, lastSeqNo 833243, uniqueness 1344566603, timestamp 1344591906/24680304
- 2012-08-10 17:45:07.722: [ CSSD][1109850432]clssnmRcfgMgrThread: Local Join
- 2012-08-10 17:45:07.722: [ CSSD][1109850432]clssnmLocalJoinEvent: begin on node(1), waittime 193000
- 2012-08-10 17:45:07.722: [ CSSD][1109850432]clssnmLocalJoinEvent: set curtime (6167594) for my node
- 2012-08-10 17:45:07.722: [ CSSD][1109850432]clssnmLocalJoinEvent: scanning 32 nodes
- 2012-08-10 17:45:07.722: [ CSSD][1109850432]clssnmLocalJoinEvent: Node node-rac2, number 2, is in an existing cluster with disk state 3
- 2012-08-10 17:45:07.723: [ CSSD][1109850432]clssnmLocalJoinEvent: takeover aborted due to cluster member node found on disk
- 2012-08-10 17:45:08.124: [ CSSD][1075632448]clssscSelect: cookie accept request 0x2aaaac029ee0
- 2012-08-10 17:45:08.124: [ CSSD][1075632448]clssgmAllocProc: (0x2aaab0072100) allocated
- 2012-08-10 17:45:08.125: [ CSSD][1075632448]clssgmClientConnectMsg: properties of cmProc 0x2aaab0072100 - 1,2,3,4,5
- 2012-08-10 17:45:08.125: [ CSSD][1075632448]clssgmClientConnectMsg: Connect from con(0x85c4) proc(0x2aaab0072100) pid(15501) version 11:2:1:4, properties: 1,2,3,4,5
- 2012-08-10 17:45:08.125: [ CSSD][1075632448]clssgmClientConnectMsg: msg flags 0x0000
- 2012-08-10 17:45:08.127: [ CSSD][1075632448]clssscSelect: cookie accept request 0x2aaab0072100
- 2012-08-10 17:45:08.127: [ CSSD][1075632448]clssscevtypSHRCON: getting client with cmproc 0x2aaab0072100
- 2012-08-10 17:45:08.127: [ CSSD][1075632448]clssgmRegisterClient: proc(4/0x2aaab0072100), client(1/0x2aaab0074960)
- 2012-08-10 17:45:08.127: [ CSSD][1075632448]clssgmJoinGrock: global grock CRF- new client 0x2aaab0074960 with con 0x85f3, requested num -1, flags 0x4000e00
- 2012-08-10 17:45:08.128: [ CSSD][1075632448]clssgmJoinGrock: ignoring grock join for client not requiring fencing until group information has been received from the master; group name CRF-, member number -1, flags 0x4000e00
- 2012-08-10 17:45:08.128: [ CSSD][1075632448]clssgmDiscEndpcl: gipcDestroy 0x85f3
- 2012-08-10 17:45:08.129: [ CSSD][1075632448]clssgmDeadProc: proc 0x2aaab0072100
- 2012-08-10 17:45:08.129: [ CSSD][1075632448]clssgmDestroyProc: cleaning up proc(0x2aaab0072100) con(0x85c4) skgpid ospid 15501 with 0 clients, refcount 0
- 2012-08-10 17:45:08.129: [ CSSD][1075632448]clssgmDiscEndpcl: gipcDestroy 0x85c4
- 2012-08-10 17:45:08.273: [ CSSD][1105119552]clssgmWaitOnEventValue: after CmInfo State val 3, eval 1 waited 0
- 2012-08-10 17:45:08.616: [ CSSD][1096227136]clssnmvDHBValidateNCopy: node 2, node-rac2, has a disk HB, but no network HB, DHB has rcfg 234386141, wrtcnt, 833245, LATS 6168494, lastSeqNo 833244, uniqueness 1344566603, timestamp 1344591907/24681304
- 2012-08-10 17:45:09.276: [ CSSD][1105119552]clssgmWaitOnEventValue: after CmInfo State val 3, eval 1 waited 0
- 2012-08-10 17:45:09.618: [ CSSD][1096227136]clssnmvDHBValidateNCopy: node 2, node-rac2, has a disk HB, but no network HB, DHB has rcfg 234386141, wrtcnt, 833246, LATS 6169494, lastSeqNo 833245, uniqueness 1344566603, timestamp 1344591908/24682304
- 2012-08-10 17:45:09.655: [ CSSD][1075632448]clssgmExecuteClientRequest: MAINT recvd from proc 2 (0x18e0da70)
- 2012-08-10 17:45:09.655: [ CSSD][1075632448]clssgmShutDown: Received abortive shutdown request from client.
- 2012-08-10 17:45:09.655: [ CSSD][1075632448]###################################
- 2012-08-10 17:45:09.655: [ CSSD][1075632448]clssscExit: CSSD aborting from thread GMClientListener
- 2012-08-10 17:45:09.655: [ CSSD][1075632448]###################################
- 2012-08-10 17:45:09.655: [ CSSD][1075632448](:CSSSC00012:)clssscExit: A fatal error occurred and the CSS daemon is terminating abnormally
- 2012-08-10 17:45:09.656: [ CSSD][1075632448]clssgmUpdateEventValue: CmInfo State val 0, changes 1
- 2012-08-10 17:45:09.727: [ CSSD][1106696512]clssnmPollingThread: state(1) clusterState(0) exit
- 2012-08-10 17:45:09.727: [ CSSD][1106696512]clssscExit: abort already set 0
- 2012-08-10 17:45:10.278: [ CSSD][1105119552]clssgmWaitOnEventValue: after CmInfo State val 3, eval 0 waited 380
- 2012-08-10 17:45:10.278: [ CSSD][1105119552]clssgmPeerListener: terminating at incarn(0)
- 2012-08-10 17:45:10.620: [ CSSD][1096227136]clssnmvDHBValidateNCopy: node 2, node-rac2, has a disk HB, but no network HB, DHB has rcfg 234386141, wrtcnt, 833247, LATS 6170494, lastSeqNo 833246, uniqueness 1344566603, timestamp 1344591909/24683304
- 2012-08-10 17:45:10.742: [ CSSD][1108273472]clssnmSendingThread: sending join msg to all nodes
- 2012-08-10 17:45:10.742: [ CSSD][1108273472]clssnmSendingThread: sent 4 join msgs to all nodes
- 2012-08-10 17:45:11.622: [ CSSD][1096227136]clssnmvDHBValidateNCopy: node 2, node-rac2, has a disk HB, but no network HB, DHB has rcfg 234386141, wrtcnt, 833248, LATS 6171494, lastSeqNo 833247, uniqueness 1344566603, timestamp 1344591910/24684314
- 2012-08-10 17:45:12.625: [ CSSD][1096227136]clssnmvDHBValidateNCopy: node 2, node-rac2, has a disk HB, but no network HB, DHB has rcfg 234386141, wrtcnt, 833249, LATS 6172504, lastSeqNo 833248, uniqueness 1344566603, timestamp 1344591911/24685314
- 2012-08-10 17:45:13.629: [ CSSD][1096227136]clssnmvDHBValidateNCopy: node 2, node-rac2, has a disk HB, but no network HB, DHB has rcfg 234386141, wrtcnt, 833250, LATS 6173504, lastSeqNo 833249, uniqueness 1344566603, timestamp 1344591912/24686314
- 2012-08-10 17:45:14.632: [ CSSD][1096227136]clssnmvDHBValidateNCopy: node 2, node-rac2, has a disk HB, but no network HB, DHB has rcfg 234386141, wrtcnt, 833251, LATS 6174504, lastSeqNo 833250, uniqueness 1344566603, timestamp 1344591913/24687314
- 2012-08-10 17:45:14.750: [ CSSD][1108273472]clssnmSendingThread: sending join msg to all nodes
- 2012-08-10 17:45:14.750: [ CSSD][1108273472]clssnmSendingThread: sent 4 join msgs to all nodes
复制代码 检查过 heartbeat network了吗?
cat /etc/hosts
2012-08-09 17:55:08.455: [ CLSINET][1093237056] # 0 Interface 'eth1',ip='10.0.0.1',mac='00-e0-81-c8-da-da',mask='255.255.255.128',net='10.0.0.0',use='cluster_interconnect'
先ping 下看看,如果能正常ping ,尝试在重启问题节点 |
|