在节点二上执行root.sh,两个节点都crash了。
虚拟机vmware server 2.0.2操作系统: oracle linux 2.6.32-300.10.1.el5uek x86_64
数据库版本:oracle 11.2.0.4 采用asm方式
问题描述:
(1)节点1上执行runcluvfy.sh stage -pre crsinst -n mynod1,mynode2 -fixup -verbose 执行成功,且节点1上执行root.sh是成功的
但是在节点2上执行root.sh时,执行到如下地方之后,节点1和节点2虚拟机都crash了。
Successfully created CRS resources for cluster daemon and ASM
Checking if initial configuration has been performed
Starting CSS in exclusive mode
CRS-4402: The CSS daemon was started in exclusive mode but found an active CSS daemon on node racnode1, number 1, and is terminating
An active cluster was found during exclusive startup, restarting to join the cluster
。
我看了一下节点2上最后的日志,如下所示:
2015-01-27 23:50:00: Starting CSS in exclusive mode
2015-01-27 23:50:00: Executing cmd: /u01/product/11.2/grid/bin/crsctl start resource ora.cssd -init -env CSSD_MODE=-X
2015-01-27 23:54:11: Command output:
> CRS-2672: Attempting to start 'ora.mdnsd' on 'mynode2'
> CRS-2676: Start of 'ora.mdnsd' on 'mynode2' succeeded
> CRS-2672: Attempting to start 'ora.gpnpd' on 'mynode2'
> CRS-2676: Start of 'ora.gpnpd' on 'mynode2' succeeded
> CRS-2672: Attempting to start 'ora.cssdmonitor' on 'mynode2'
> CRS-2672: Attempting to start 'ora.gipcd' on 'mynode2'
> CRS-2676: Start of 'ora.gipcd' on 'mynode2' succeeded
> CRS-2676: Start of 'ora.cssdmonitor' on 'mynode2' succeeded
> CRS-2672: Attempting to start 'ora.cssd' on 'mynode2'
> CRS-2672: Attempting to start 'ora.diskmon' on 'mynode2'
> CRS-2676: Start of 'ora.diskmon' on 'mynode2' succeeded
> CRS-4402: The CSS daemon was started in exclusive mode but found an active CSS daemon on node mynode1, number 1, and is terminati
ng
> CRS-2674: Start of 'ora.cssd' on 'mynode2' failed
> CRS-2679: Attempting to clean 'ora.cssd' on 'mynode2'
> CRS-2681: Clean of 'ora.cssd' on 'mynode2' succeeded
> CRS-2673: Attempting to stop 'ora.gipcd' on 'mynode2'
> CRS-2677: Stop of 'ora.gipcd' on 'mynode2' succeeded
> CRS-2673: Attempting to stop 'ora.cssdmonitor' on 'mynode2'
> CRS-2677: Stop of 'ora.cssdmonitor' on 'mynode2' succeeded
> CRS-2673: Attempting to stop 'ora.gpnpd' on 'mynode2'
> CRS-2677: Stop of 'ora.gpnpd' on 'mynode2' succeeded
> CRS-2673: Attempting to stop 'ora.mdnsd' on 'mynode2'
> CRS-2677: Stop of 'ora.mdnsd' on 'mynode2' succeeded
> CRS-4000: Command Start failed, or completed with errors.
>End Command output
2015-01-27 23:54:11: CRS-4402: The CSS daemon was started in exclusive mode but found an active CSS daemon on node mynode1, number 1
, and is terminating
2015-01-27 23:54:11: Executing cmd: /u01/product/11.2/grid/bin/crsctl stop resource ora.cssd -init
2015-01-27 23:54:11: Executing cmd: /u01/product/11.2/grid/bin/crsctl stop resource ora.cssdmonitor -init
2015-01-27 23:54:11: Executing cmd: /u01/product/11.2/grid/bin/crsctl stop resource ora.gipcd -f -init
2015-01-27 23:54:11: Executing cmd: /u01/product/11.2/grid/bin/crsctl stop resource ora.gpnpd -f -init
2015-01-27 23:54:11: Executing cmd: /u01/product/11.2/grid/bin/crsctl stop resource ora.mdnsd -init
2015-01-27 23:54:12: Executing cmd: /u01/product/11.2/grid/bin/crsctl check resource ora.diskmon -init
2015-01-27 23:54:12: Executing cmd: /u01/product/11.2/grid/bin/crsctl status resource ora.diskmon -init
2015-01-27 23:54:12: Checking the status of ora.diskmon
2015-01-27 23:54:17: Executing cmd: /u01/product/11.2/grid/bin/crsctl status resource ora.diskmon -init
2015-01-27 23:54:17: Checking the status of ora.diskmon
2015-01-27 23:54:22: An active cluster was found during exclusive startup, restarting to join the cluster
2015-01-27 23:54:22: Running as user grid: /u01/product/11.2/grid/bin/cluutil -ckpt -oraclebase /u01/product/grid -writeckpt -name R
OOTCRS_BOOTCFG -state SUCCESS
2015-01-27 23:54:22: s_run_as_user2: Running /bin/su grid -c ' /u01/product/11.2/grid/bin/cluutil -ckpt -oraclebase /u01/product/gri
d -writeckpt -name ROOTCRS_BOOTCFG -state SUCCESS '
2015-01-27 23:54:22: Removing file /tmp/file9WQ5xn
2015-01-27 23:54:22: Successfully removed file: /tmp/file9WQ5xn
2015-01-27 23:54:22: /bin/su successfully executed
(2)今天在节点1上想卸载掉昨天出问题的内容重新进行安装,执行./deinstall,等待了很长时间,输出如下日志后,节点1再次宕机:
$ ./deinstall
Checking for required files and bootstrapping ...
Please wait ...
Location of logs /tmp/deinstall2015-01-28_04-00-54PM/logs/
############ ORACLE DEINSTALL & DECONFIG TOOL START ############
######################### CHECK OPERATION START #########################
## Install check configuration ##
刚才执行了 ./deinstall之后,又重新安装了一遍,问题依旧,节点2上执行到如下地方之后,两个节点同时失去反应(两个虚拟机自己关闭了):
# /u01/product/11.2/grid/root.sh
Performing root user operation for Oracle 11g
The following environment variables are set as:
ORACLE_OWNER= grid
ORACLE_HOME= /u01/product/11.2/grid
Enter the full pathname of the local bin directory: :
The contents of "dbhome" have not changed. No need to overwrite.
The contents of "oraenv" have not changed. No need to overwrite.
The contents of "coraenv" have not changed. No need to overwrite.
Creating /etc/oratab file...
Entries will be added to the /etc/oratab file as needed by
Database Configuration Assistant when a database is created
Finished running generic part of root script.
Now product-specific root actions will be performed.
Using configuration parameter file: /u01/product/11.2/grid/crs/install/crsconfig_params
Creating trace directory
Installing Trace File Analyzer
OLR initialization - successful
Adding Clusterware entries to inittab
CRS-4402: The CSS daemon was started in exclusive mode but found an active CSS daemon on node mynode1, number 1, and is terminating
An active cluster was found during exclusive startup, restarting to join the cluster 节点2上的日志如下:
# tail -f rootcrs_mynode2.log
2015-01-29 16:17:32: Succeeded in writing the checkpoint:'ROOTCRS_BOOTCFG' with status:START
2015-01-29 16:17:32: CkptFile: /u01/product/grid/Clusterware/ckptGridHA_mynode2.xml
2015-01-29 16:17:32: Sync the checkpoint file '/u01/product/grid/Clusterware/ckptGridHA_mynode2.xml'
2015-01-29 16:17:32: Sync '/u01/product/grid/Clusterware/ckptGridHA_mynode2.xml' to the physical disk
2015-01-29 16:17:32: Executing cmd: /u01/product/11.2/grid/bin/crsctl check css
2015-01-29 16:17:33: Command output:
> CRS-4530: Communications failure contacting Cluster Synchronization Services daemon
>End Command output
2015-01-29 16:17:33: Starting CSS in exclusive mode
2015-01-29 16:17:33: Executing cmd: /u01/product/11.2/grid/bin/crsctl start resource ora.cssd -init -env CSSD_MODE=-X
2015-01-29 16:20:55: Command output:
> CRS-2672: Attempting to start 'ora.mdnsd' on 'mynode2'
> CRS-2676: Start of 'ora.mdnsd' on 'mynode2' succeeded
> CRS-2672: Attempting to start 'ora.gpnpd' on 'mynode2'
> CRS-2676: Start of 'ora.gpnpd' on 'mynode2' succeeded
> CRS-2672: Attempting to start 'ora.cssdmonitor' on 'mynode2'
> CRS-2672: Attempting to start 'ora.gipcd' on 'mynode2'
> CRS-2676: Start of 'ora.gipcd' on 'mynode2' succeeded
> CRS-2676: Start of 'ora.cssdmonitor' on 'mynode2' succeeded
> CRS-2672: Attempting to start 'ora.cssd' on 'mynode2'
> CRS-2672: Attempting to start 'ora.diskmon' on 'mynode2'
> CRS-2676: Start of 'ora.diskmon' on 'mynode2' succeeded
> CRS-4402: The CSS daemon was started in exclusive mode but found an active CSS daemon on node mynode1, number 1, and is terminating
> CRS-2674: Start of 'ora.cssd' on 'mynode2' failed
> CRS-2679: Attempting to clean 'ora.cssd' on 'mynode2'
> CRS-2681: Clean of 'ora.cssd' on 'mynode2' succeeded
> CRS-2673: Attempting to stop 'ora.gipcd' on 'mynode2'
> CRS-2677: Stop of 'ora.gipcd' on 'mynode2' succeeded
> CRS-4000: Command Start failed, or completed with errors.
>End Command output
2015-01-29 16:20:55: CRS-4402: The CSS daemon was started in exclusive mode but found an active CSS daemon on node mynode1, number 1, and is terminating
2015-01-29 16:20:55: Executing cmd: /u01/product/11.2/grid/bin/crsctl stop resource ora.cssd -init
2015-01-29 16:23:52: Command output:
> CRS-2673: Attempting to stop 'ora.cssd' on 'mynode2'
> CRS-2677: Stop of 'ora.cssd' on 'mynode2' succeeded
>End Command output
2015-01-29 16:23:52: Executing cmd: /u01/product/11.2/grid/bin/crsctl stop resource ora.cssdmonitor -init
2015-01-29 16:23:52: Resource 'ora.cssdmonitor -init' was not online but was successfully stopped
2015-01-29 16:23:52: Executing cmd: /u01/product/11.2/grid/bin/crsctl stop resource ora.gipcd -f -init
2015-01-29 16:23:54: Command output:
> CRS-2673: Attempting to stop 'ora.gipcd' on 'mynode2'
> CRS-2677: Stop of 'ora.gipcd' on 'mynode2' succeeded
>End Command output
2015-01-29 16:23:54: Executing cmd: /u01/product/11.2/grid/bin/crsctl stop resource ora.gpnpd -f -init
2015-01-29 16:23:56: Command output:
> CRS-2673: Attempting to stop 'ora.gpnpd' on 'mynode2'
> CRS-2677: Stop of 'ora.gpnpd' on 'mynode2' succeeded
>End Command output
2015-01-29 16:23:56: Executing cmd: /u01/product/11.2/grid/bin/crsctl stop resource ora.mdnsd -init
2015-01-29 16:24:01: Command output:
> CRS-2673: Attempting to stop 'ora.mdnsd' on 'mynode2'
> CRS-2677: Stop of 'ora.mdnsd' on 'mynode2' succeeded
>End Command output
2015-01-29 16:24:01: Executing cmd: /u01/product/11.2/grid/bin/crsctl check resource ora.diskmon -init
2015-01-29 16:24:01: Executing cmd: /u01/product/11.2/grid/bin/crsctl status resource ora.diskmon -init
2015-01-29 16:24:01: Checking the status of ora.diskmon
2015-01-29 16:24:06: Executing cmd: /u01/product/11.2/grid/bin/crsctl status resource ora.diskmon -init
2015-01-29 16:24:07: Checking the status of ora.diskmon
2015-01-29 16:24:12: An active cluster was found during exclusive startup, restarting to join the cluster
2015-01-29 16:24:12: Running as user grid: /u01/product/11.2/grid/bin/cluutil -ckpt -oraclebase /u01/product/grid -writeckpt -name ROOTCRS_BOOTCFG -state SUCCESS
2015-01-29 16:24:12: s_run_as_user2: Running /bin/su grid -c ' /u01/product/11.2/grid/bin/cluutil -ckpt -oraclebase /u01/product/grid -writeckpt -name ROOTCRS_BOOTCFG -state SUCCESS '
2015-01-29 16:24:12: Removing file /tmp/fileF9pAwT
2015-01-29 16:24:12: Successfully removed file: /tmp/fileF9pAwT
2015-01-29 16:24:12: /bin/su successfully executed
2015-01-29 16:24:12: Succeeded in writing the checkpoint:'ROOTCRS_BOOTCFG' with status:SUCCESS
2015-01-29 16:24:12: CkptFile: /u01/product/grid/Clusterware/ckptGridHA_mynode2.xml
2015-01-29 16:24:12: Sync the checkpoint file '/u01/product/grid/Clusterware/ckptGridHA_mynode2.xml'
2015-01-29 16:24:12: Sync '/u01/product/grid/Clusterware/ckptGridHA_mynode2.xml' to the physical disk
2015-01-29 16:24:12: Starting
2015-01-29 16:24:12: Executing cmd: /u01/product/11.2/grid/bin/crsctl start resource ora.mdnsd -init
2015-01-29 16:24:14: Command output:
> CRS-2672: Attempting to start 'ora.mdnsd' on 'mynode2'
> CRS-2676: Start of 'ora.mdnsd' on 'mynode2' succeeded
>End Command output
2015-01-29 16:24:14: Start of resource "ora.mdnsd" Succeeded
2015-01-29 16:24:14: Executing cmd: /u01/product/11.2/grid/bin/crsctl start resource ora.gpnpd -init
2015-01-29 16:24:15: Command output:
> CRS-2672: Attempting to start 'ora.gpnpd' on 'mynode2'
> CRS-2676: Start of 'ora.gpnpd' on 'mynode2' succeeded
>End Command output
2015-01-29 16:24:15: Start of resource "ora.gpnpd" Succeeded
2015-01-29 16:24:15: Executing cmd: /u01/product/11.2/grid/bin/gpnptool lfind
2015-01-29 16:24:15: Command output:
> Success. Local gpnpd found.
>End Command output
2015-01-29 16:24:15: Successfully started requested Oracle stack daemons
2015-01-29 16:24:15: Starting CSS in clustered mode
2015-01-29 16:24:15: Executing cmd: /u01/product/11.2/grid/bin/crsctl start resource ora.cssd -init
2015-01-29 16:25:39: Command output:
> CRS-2672: Attempting to start 'ora.cssdmonitor' on 'mynode2'
> CRS-2672: Attempting to start 'ora.gipcd' on 'mynode2'
> CRS-2676: Start of 'ora.cssdmonitor' on 'mynode2' succeeded
> CRS-2676: Start of 'ora.gipcd' on 'mynode2' succeeded
> CRS-2672: Attempting to start 'ora.cssd' on 'mynode2'
> CRS-2672: Attempting to start 'ora.diskmon' on 'mynode2'
> CRS-2676: Start of 'ora.diskmon' on 'mynode2' succeeded
> CRS-2676: Start of 'ora.cssd' on 'mynode2' succeeded
>End Command output
2015-01-29 16:25:39: Start of resource "ora.cssd" Succeeded
2015-01-29 16:25:39: Executing cmd: /u01/product/11.2/grid/bin/crsctl start resource ora.ctssd -init -env USR_ORA_ENV=CTSS_REBOOT=TRUE
2015-01-29 16:26:01: Command output:
> CRS-2672: Attempting to start 'ora.ctssd' on 'mynode2'
> CRS-2676: Start of 'ora.ctssd' on 'mynode2' succeeded
>End Command output
2015-01-29 16:26:01: Start of resource "ora.ctssd" Succeeded
2015-01-29 16:26:01: Executing cmd: /u01/product/11.2/grid/bin/crsctl start resource ora.crf -init
2015-01-29 16:26:32: Command output:
> CRS-2672: Attempting to start 'ora.crf' on 'mynode2'
> CRS-2676: Start of 'ora.crf' on 'mynode2' succeeded
>End Command output
2015-01-29 16:26:32: Start of resource "ora.crf" Succeeded
2015-01-29 16:26:32: Executing /u01/product/11.2/grid/bin/crsctl modify res ora.cluster_interconnect.haip -attr "ENABLED=1" -init
2015-01-29 16:26:32: Executing cmd: /u01/product/11.2/grid/bin/crsctl modify res ora.cluster_interconnect.haip -attr "ENABLED=1" -init
2015-01-29 16:26:41: Executing cmd: /u01/product/11.2/grid/bin/crsctl start resource ora.cluster_interconnect.haip -init
2015-01-29 16:27:10: Command output:
> CRS-2672: Attempting to start 'ora.cluster_interconnect.haip' on 'mynode2'
> CRS-2676: Start of 'ora.cluster_interconnect.haip' on 'mynode2' succeeded
>End Command output
2015-01-29 16:27:10: Start of resource "ora.cluster_interconnect.haip" Succeeded
2015-01-29 16:27:10: Executing cmd: /u01/product/11.2/grid/bin/crsctl start resource ora.asm -init 描述下你的物理机 物理机是thinkpad 笔记本;上面安装了vmware server ,创建了两个虚拟机(mynode1,mynode2)。
每个虚拟机分配的配置是:1个cpu ,3G内存,4G swap, 30G磁盘空间,两个虚拟网卡(eth0,eth1),其中eth0是用的桥接方式,eth1是NAT方式。虚拟了6个共享磁盘,每个5G。 我的意思是 物理机 什么配置, 我没有问 虚拟机的事情 型号:thinkpad x201
CPU:i5 m540 2.53GHZ
memory: 8G 虚拟机种类不符合要求, 机器硬件内存不符合要求
页:
[1]