RAC one instance cann't startup CRS-0215 & ORA-29702: error occurred in Cluster Group Service operation

简介:
环境 : 
Oracle 10.2.0.4
CentOS 5.x 64bit
2 node RAC, raw 设备
通过光纤连HP存储.

存储出现故障并修复后, 其中一台(node2)的instance无法启动, 报错如下
1. $ORACLE_BASE/admin/$SID/bdump/alter_$sid.log
Error: KGXGN polling error (15)
Mon Oct 15 11:39:11 2012
Errors in file /opt/oracle/admin/skycac/bdump/skycac2_lmon_5763.trc:
ORA-29702: error occurred in Cluster Group Service operation
LMON: terminating instance due to error 29702
Mon Oct 15 11:39:12 2012
System state dump is made for local instance
Mon Oct 15 11:39:12 2012
Errors in file /opt/oracle/admin/skycac/bdump/skycac2_diag_5759.trc:
ORA-29702: error occurred in Cluster Group Service operation
Mon Oct 15 11:39:12 2012
Trace dumping is performing id=[cdmp_20121015113912]
Mon Oct 15 11:39:12 2012
Instance terminated by LMON, pid = 5763


2. $CRS_HOME/log/node2
2012-10-15 14:30:26.364
[crsd(14402)]CRS-1205:Auto-start failed for the CRS resource . Details in db-192-168-xxx-xxx.

3. crs_stat -t结果中除了instance_node2 OFFLINE 其他都正常.

解决办法 :
1. cd $ORACLE_HOME/rdbms/lib
2. rename the original library (if exists)
mv libskgxp10.so libskgxp10.so.old
3. Relink to configure UDP for IPC
make -f ins_rdbms.mk rac_on 
make -f ins_rdbms.mk ipc_udp 
make -f ins_rdbms.mk ioracle
4. Check whether the library exists
ls -l $ORACLE_HOME/lib/libskgxp10.so
5. start instance
srvctl start instance -d $dbname -i $instance_name

【参考】
目录
相关文章
|
7月前
rac Cluster Startup
https://docs.oracle.com/cd/E11882_01/rac.112/e41959/intro.htm#i1058057
39 0
|
负载均衡 Oracle 关系型数据库
Vmware 下Oracle RAC搬家引起CRS-1006/CRS-0215/CRS-0233
   最近虚拟机下的Oracle 10g RAC搬家,搬家完毕之后,Oracle 集群resource之VIP无法正常启动,收到了CRS-0233: Resource or relatives are currently involved with another operation 错误提示。
1184 0
|
4月前
|
运维 Oracle 前端开发
Oracle 11g RAC集群日常运维命令总结
Oracle 11g RAC集群日常运维命令总结
111 2
|
4月前
|
Oracle 关系型数据库
分布式锁设计问题之Oracle RAC保证多个节点写入内存Page的一致性如何解决
分布式锁设计问题之Oracle RAC保证多个节点写入内存Page的一致性如何解决
|
5月前
|
存储 负载均衡 Oracle
|
5月前
|
存储 Oracle 关系型数据库