主机:HP-UX essaop1 B.11.31 U ia64 1945507590 unlimited-user license
数据库:Oracle Database 11g Enterprise Edition Release 11.2.0.3.0 - 64bit Production
该系统主机数据库监听服务器,启停不超过5分钟就会自动重启。业务短暂的受到影响。重启过操作系统网卡APA服务后,3个月又开始频繁重启。
xx.xx.89.3_aop数据库,监听总是重启。经过在Oracle Support官网查询,及其与一个bug相匹配。
vi /oracle/grid/11.2.0/log/essaop2/agent/crsd/orarootagent_root/orarootagent_root.log
2015-08-13 09:42:09.500: [ AGFW][10] {2:56751:2} Agent received the message: AGENT_HB[Engine] ID 12293:2614441
2015-08-13 09:42:12.618: [ default][271023]ICMP Ping from xxx.xxx.89.3 to xx.xx.89.62
2015-08-13 09:42:12.638: [ default][271023]clsicmp_pingdecode recvd other process's packet
2015-08-13 09:42:12.638: [ora.net1.network][271023] {0:2:9230} [check] NetworkAgent::checkLink returned false
2015-08-13 09:42:12.640: [ AGFW][10] {0:2:9230} ora.net1.network essaop2 1 state changed from: ONLINE to: OFFLINE
2015-08-13 09:42:12.640: [ AGFW][10] {0:2:9230} Switching online monitor to offline one
2015-08-13 09:42:12.641: [ AGFW][10] {0:2:9230} Started implicit monitor for [ora.net1.network essaop2 1] interval=60000 delay=60000
meatlink Bug 16039587 。
HP UNIX下需要操作系统打个补丁:
Cause
The issue was investigated in Bug 16039587, the cause is HP-UX bug, basically the contention of address memory range lock on kernel memory causes poll(2) timeout and affects orarootagent process.
Solution
Apply OS kernel patch PHKL_42850.
找到相关补丁,进行了工单审批,到实施完成。AOP数据库监听闪断重大隐患已经解决。
STARTED_AT UPTIME
--------------------------------------------------------------------------------
18-SEP-2015 23:35:29 2 day(s), 10 hour(s), 2 minute(s), 54 seconds 数据库启动时间
SQL> !uptime
9:38am up 2 days, 11:15, 1 user, load average: 0.03, 0.03, 0.03 主机启动时间
SQL> !lsnrctl status 数据库监听启动时间
LSNRCTL for HPUX: Version 11.2.0.3.0 - Production on 21-SEP-2015 09:50:45
Copyright (c) 1991, 2011, Oracle. All rights reserved.
Connecting to (ADDRESS=(PROTOCOL=tcp)(HOST=)(PORT=1521))
STATUS of the LISTENER
------------------------
Alias LISTENER
Version TNSLSNR for HPUX: Version 11.2.0.3.0 - Production
Start Date 18-SEP-2015 23:34:01
Uptime 2 days 10 hr. 16 min. 44 sec
Trace Level off
Security ON: Local OS Authentication
SNMP OFF
Listener Parameter File /oracle/grid/11.2.0/network/admin/listener.ora
Listener Log File /oracle/grid/base/diag/tnslsnr/essaop2/listener/alert/log.xml
Listening Endpoints Summary...
(DESCRIPTION=(ADDRESS=(PROTOCOL=ipc)(KEY=LISTENER)))
(DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=132.35.89.3)(PORT=1521)))
(DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=132.35.89.4)(PORT=1521)))
Services Summary...
Service "+ASM" has 1 instance(s).
Instance "+ASM2", status READY, has 1 handler(s) for this service...
Service "essaop" has 1 instance(s).
Instance "essaop2", status READY, has 1 handler(s) for this service...
Service "essaopXDB" has 1 instance(s).
Instance "essaop2", status READY, has 1 handler(s) for this service...
The command completed successfully