private 网络不稳定引起的Evicting instance 2 from cluster


环境:双节点RAC, oracle 11.2.3
客户电话RAC实例2异常,现场查看日志:实例2:Fri Aug 25 09:45:16 2017Received an instance abort message from instance 1Received an instance abort message from instance 1
Please check instance 1 alert and LMON trace files for detail.Please check instance 1 alert and LMON trace files for detail.
LMS0 (ospid: 24510820): terminating the instance due to error 481Fri Aug 25 09:45:16 2017System state dump requested by (instance=2, osid=24510820 (LMS0)), summary=[abnormal instance termination].System State dumped to trace file /oracle/11.2.0/diag/rdbms/ins/ins2/trace/ins2_diag_21561818.trcInstance terminated by LMS0, pid = 24510820实例1Fri Aug 25 09:44:25 2017IPC Send timeout detected. Sender: ospid 35783054 [oracle@db1 (LMS1)]Receiver: inst 2 binc 2073329022 ospid 24183072IPC Send timeout to 2.2 inc 28 for msg type 65518 from opid 14Fri Aug 25 09:44:27 2017Communications reconfiguration: instance_number 2Fri Aug 25 09:45:16 2017Detected an inconsistent instance membership by instance 1Evicting instance 2 from clusterWaiting for instances to leave: 2Fri Aug 25 09:45:16 2017Dumping diagnostic data in directory=[cdmp_20170825094516], requested by (instance=2, osid=24510820 (LMS0)), summary=[abnormal instance termination].Reconfiguration started (old inc 28, new inc 32)List of instances:1 (myinst: 1)查看/oracle/11.2.0/diag/rdbms/gjj/ins2/trace/ins2_diag_21561818.trc*** 2017-08-25 14:24:35.900I’m the voting nodeGroup reconfiguration cleanupconfirm->incar_num 22, rcfgctx->prop_incar 0Send my bitmap to master 0kjzgmappropose : incar 0, newmap -30000000000000000000000000000000000000000000000000000免费云主机域名00000000000kjzgmappropose : rc from psnd : 30kjzdattdlm: Can not attach to DLM (LMON up=[TRUE], DB mounted=[FALSE]).kjzdattdlm: Can not attach to DLM (LMON up=[TRUE], DB mounted=[FALSE]).怀疑心跳网络存在问题(这套RAC之前就出现过几次实例被驱逐的问题,但实例自动都启动了,这次实例被驱逐后实例2不能启动,针对之前实例被驱逐的问题进行过参数修改,通过这次的情况来看,实该不是参数设置的问题)。测试心跳网络,连通性和传输速率都没有问题,后续打算通过haip进一步提升心跳网络可用性,在添加haip过程中发现当服和服务器和交换机新添加网络后出来数据包丢失的情况,丢包率50%,判断心跳网络稳定性存在问题,基于此撤掉新添加的心跳线,更换原来的心跳线,重启被驱逐的实例2,实例正常。最后判断是原心跳线RJ45头存在某两芯短路的问题造成此次故障。

相关推荐: linux7系统怎么安装oracle12C R2

这篇文章主要介绍“linux7系统怎么安装oracle12C R2”,在日常操作中,相信很多人在linux7系统怎么安装oracle12C R2问题上存在疑惑,小编查阅了各式资料,整理出简单好用的操作方法,希望对大家解答”linux7系统怎么安装oracle1…

免责声明:本站发布的图片视频文字,以转载和分享为主,文章观点不代表本站立场,本站不承担相关法律责任;如果涉及侵权请联系邮箱:360163164@qq.com举报,并提供相关证据,经查实将立刻删除涉嫌侵权内容。

(0)
打赏 微信扫一扫 微信扫一扫
上一篇 01/17 11:22
下一篇 01/17 11:22