环境:
操作系统: AIX 5300-09
集群软件: CRS 10.2.0.1
数据库: Oracle 10.2.0.1
故障现象:
运行完,root.sh 后在node 上运行vipca 出现以下错误
[root@aix209 bin]#./vipca
PRKR-1062 : Failed to find configuration for node aix209
PRKR-1062 : Failed to find configuration for node aix209
解决方法:
1、删除该节点CRS 配置信息,并重新执行root.sh
[root@aix209 install]#./rootdelete.sh
Shutting down Oracle Cluster Ready Services (CRS):
Stopping resources.
Error while stopping resources. Possible cause: CRSD is down.
Stopping CSSD.
Shutting down CSS daemon.
Shutdown request successfully issued.
Shutdown has begun. The daemons should exit soon.
Checking to see if Oracle CRS stack is down...
Oracle CRS stack is not running.
Oracle CRS stack is down now.
Removing script for Oracle Cluster Ready services
Updating ocr file for downgrade
Cleaning up SCR settings in '/etc/oracle/scls_scr'
[root@aix209 install]#/u01/crs_1/root.sh
WARNING: directory '/u01' is not owned by root
Checking to see if Oracle CRS stack is already configured
Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
WARNING: directory '/u01' is not owned by root
clscfg: EXISTING configuration version 3 detected.
clscfg: version 3 is 10G Release 2.
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 1: aix209 aix209-priv aix209
node 2: aix210 aix210-priv aix210
clscfg: Arguments check out successfully.
NO KEYS WERE WRITTEN. Supply -force parameter to override.
-force is destructive and will destroy any previous cluster
configuration.
Oracle Cluster Registry for cluster has already been initialized
Startup will be queued to init within 30 seconds.
Adding daemons to inittab
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
CSS is active on these nodes.
aix209
aix210
CSS is active on all nodes.
Waiting for the Oracle CRSD and EVMD to start
Oracle CRS stack installed and running under init(1M)
Running vipca(silent) for configuring nodeapps
IP address "aix209-vip" has already been used. Enter an unused IP address.
[root@aix209 install]#
2、重新执行vipca
vipca 可以执行!
3、但在,执行过程中报以下错误
忽略该错误后,出现以下问题:
[root@aix209 /]#crs_stat -t
Name Type Target State Host
------------------------------------------------------------
ora.aix209.gsd application ONLINE ONLINE aix209
ora.aix209.ons application ONLINE ONLINE aix209
ora.aix210.gsd application ONLINE ONLINE aix210
ora.aix210.ons application ONLINE ONLINE aix210
ora.aix210.vip application ONLINE ONLINE aix210
在crs 资源里,缺少ora.aix209.vip的资源!
4、手工创建该资源
[u@h W]$ srvctl add nodeapps -n aix209 -o /u01/app/oracle/product/10.2.0/db_1 -A 192.168.8.143/255.255.255.0
CRS-0210: Could not find resource 'ora.aix209.vip'.
[oracle@aix209 bin]$./crs_profile -create ora.aix209.vip -t application -a ora.aix209.vip.src
[oracle@aix209 bin]$ls -l /u01/crs_1/crs/script/
total 0
[oracle@aix209 bin]$ls -l /u01/crs_1/crs/public/
total 16
-rw-rw---- 1 oracle oinstall 3396 Aug 3 2004 action_scr.scr
-rw-r--r-- 1 oracle oinstall 752 Apr 8 16:26 ora.aix209.vip.cap
[oracle@aix209 bin]$cd /u01/crs_1/crs/public/
[oracle@aix209 public]$cp action_scr.scr ../script/ora.aix209.vip.src
可以注册ora.aix209.vip资源:
[root@aix209 /]#crs_register ora.aix209.vip
[root@aix209 /]#srvctl add nodeapps -n aix209 -o /u01/app/oracle/product/10.2.0/db_1 -A 192.168.8.143/255.255.255.0
[root@aix209 /]#crs_stat -t
Name Type Target State Host
------------------------------------------------------------
ora.aix209.gsd application ONLINE ONLINE aix209
ora.aix209.ons application ONLINE ONLINE aix209
ora.aix209.vip application OFFLINE OFFLINE
ora.aix210.gsd application ONLINE ONLINE aix210
ora.aix210.ons application ONLINE ONLINE aix210
ora.aix210.vip application ONLINE ONLINE aix210
但是启动失败!
[root@aix209 /]#crs_start -f ora.aix209.vip
Attempting to start `ora.aix209.vip` on member `aix209`
`ora.aix209.vip` on member `aix209` has experienced an unrecoverable failure.
Human intervention required to resume its availability.
CRS-0215: Could not start resource 'ora.aix209.vip'.
五、最终解决方法:
重新清理OCR和VOTE磁盘,重新执行root.sh(在两个节点上)
[root@aix210 /]#dd if=/dev/zero of=/dev/rrac_ocr bs=8192 count=2560
2560+0 records in
2560+0 records out
[root@aix210 /]#dd if=/dev/zero of=/dev/rrac_vote bs=8192 count=2560
2560+0 records in
2560+0 records out
[root@aix210 /]#/u01/crs_1/root.sh
[root@aix209 /]#crs_stat -t
Name Type Target State Host
------------------------------------------------------------
ora.aix209.vip application OFFLINE OFFLINE
重新运行vipca:
[root@aix209 /]#crs_stat -t
Name Type Target State Host
------------------------------------------------------------
ora.aix209.gsd application ONLINE ONLINE aix209
ora.aix209.ons application ONLINE ONLINE aix209
ora.aix209.vip application ONLINE ONLINE aix209
ora.aix210.gsd application ONLINE ONLINE aix210
ora.aix210.ons application ONLINE ONLINE aix210
ora.aix210.vip application ONLINE ONLINE aix210
[root@aix209 /]#
@至此,问题解决!