• 博客访问: 2178739
  • 博文数量: 1587
  • 用 户 组: 普通用户
  • 注册时间: 2008-04-17 14:52
个人简介

暂无介绍

文章分类

全部博文(1587)

文章存档

2012年(27)

2011年(37)

2009年(6)

2008年(194)

2007年(448)

2006年(640)

2005年(235)

我的朋友

分类: Linux

2012-03-29 11:07:30

同事测试的 rac vip failback 的问题,记录下:
 

测试从交换机上或者数据库主机端关闭eth0public nic 240/242

拔网线的效果也一样,任一一个网卡,vip漂移正常。

重启原来网卡后,vip漂移回来,但交换机收到arp包,将vipmac地址设置为全0

从而导致网外无法正常连接到failback 回来的vip地址。

Mac 缓冲更新后,恢复正常。

目前交换机设置的mac地址失效时间为4小时。

 

在某些情况下,例如这个casemac地址刷新造成vip failback后在一段时间内不可用。

10.2.0.4 and 11.1版本中,vip默认不做failback,在11gr2中,默认是failback的。

 

解决方法:

1)取消自动failback,修改 $ORACLE_HOME/bin/racgwrap 脚本,启用:

ORA_RACG_VIP_FAILBACK=1 && export ORA_RACG_VIP_FAILBACK

 

2)缩短交换机mac地址缓存时间,由于可能影响整个vlan,所以昨天没有测试。

 

3)手工重定位或者调用脚本,vip漂移后,重新广播新的地址。  arping  -c 5 -U -I eth0 $vipaddress

 

文档和相关bug

VIP does not relocate back to the original node starting from 10.2.0.4 and 11.1 even after the public network problem is resolved. [ID 805969.1]

 

 

Bug 11069846 - Gratuitous ARP packets must be targeted to router to avoid connectivity problems - superceded [ID 11069846.8]


 

修改时间 01-FEB-2012     类型 PATCH     状态 PUBLISHED

 

 

Bug 11069846  Gratuitous ARP packets must be targeted to router to avoid connectivity problems - superceded

 This note gives a brief overview of bug 11069846.
 The content was last updated on: 01-FEB-2012
 Click here for details of each of the sections below.

Affects:

Product (Component)

Oracle Server (PCW)

Range of versions believed to be affected

Versions BELOW 12.1

Versions confirmed as being affected

Platforms affected

Generic (all / most platforms affected)


 Note that this fix can cause / expose the problem described in Bug:13440962

 Note that this fix has been superceded by the fix in Bug:13440962

Fixed:

This issue is fixed in

 

Symptoms:

Related To:

 

  • (None Specified)

 

Description

Note: 
 This fix can cause the problem described in bug 13440962. 
 Please use that fix instead of this one.
 
After failover, the SCAN/VIP addresses are sometimes not reachable from 
outside the network after normal startup.  When this connectivity problem
exists, the addresses are reachable from within the network (subnet), 
but not from outside.  
 
Rediscovery Notes:
 Router ARP cache is not refreshing (router still thinks the IP belongs 
 to the old MAC address) when the SCAN/VIP addresses move from one node 
 to the other
 
Workaround
 None
 

HOOKS CAUSES:13440962 SUPERCEDED:13440962 LIKELYAFFECTS XAFFECTS_11.2.0.1 XAFFECTS_V11020001 AFFECTS=11.2.0.1 XAFFECTS_11.2.0.2 XAFFECTS_V11020002 AFFECTS=11.2.0.2 XPRODID_5 PRODUCT_ID=5 PRODID-5 PCW XCOMP_PCW COMPONENT=PCW TAG_OPSM OPSM FIXED_11.2.0.3 FIXED_12.1.0.0

Please note: The above is a summary description only. Actual symptoms can vary. Matching to any symptoms here does not confirm that you are encountering this problem. For questions about this bug please consult Oracle Support.

References

Bug:11069846 (This link will only work for PUBLISHED bugs)
Note:245840.1 Information on the sections in this article

 

 

 


 

Bug 13440962 - Different subnet failed to connect to vip after restart vip [ID 13440962.8]


 

修改时间 01-FEB-2012     类型 PATCH     状态 PUBLISHED

 

 

Bug 13440962  Different subnet failed to connect to vip after restart vip

 This note gives a brief overview of bug 13440962. 
 The content was last updated on: 01-FEB-2012
 Click here for details of each of the sections below.

Affects:

Product (Component)

Oracle Server (PCW)

Range of versions believed to be affected

Versions >= 11.2.0.3 but BELOW 12.1

Versions confirmed as being affected

Platforms affected

Generic (all / most platforms affected)


 It is believed to be a regression in default behaviour thus:
   Regression introduced in 11.2.0.3

Fixed:

This issue is fixed in

 

Symptoms:

Related To:

 

  • (None Specified)

 

Description

This is a regression fix for problem introduced by patch 11069846. 
The change in this patch (patch 13440962) fixes a problem with 4 extra 
bytes in the GARP message and removes an extra unicast GARP packet to 
the router.
 
Rediscovery Notes:
 After upgrading to 11.2.0.3, after vip failover, the ip address is 
 not pingable from a different subnet on Linux. 
 (This problem is seen only on Linux)
 
Workaround
 After vip failover, run command 
   /sbin/arping -U -c 3 -I   
 to update the ARP table of router. 
 

Please note: The above is a summary description only. Actual symptoms can vary. Matching to any symptoms here does not confirm that you are encountering this problem. For questions about this bug please consult Oracle Support.

References

Bug:13440962 (This link will only work for PUBLISHED bugs)
Note:245840.1 Information on the sections in this article

阅读(2580) | 评论(1) | 转发(0) |
给主人留下些什么吧!~~

zhouwf07262014-04-14 09:42:47

文档号给漏掉了,How to disable Automatic VIP failback (Doc ID 1280218.1)

zhouwf07262012-04-05 16:40:29

打补丁过程: (1) 安装最新opatch。 (2) 安装补丁p12539000_112024_Linux-x86-64。 (3) 升级grid crs和oracle software到11.2.0.3.0 (p10404530_112030_Linux-x86-64),之前必须先安装补丁p12539000_112024_Linux-x86-64,否则升级过程报错。 (4) 升级数据库。 (5) 安装补丁p13440962_112030_Linux-x86-64。 升级过程每个步骤,按Readme文件逐步执行,步骤较多,遇到问题,检查相应的log文件即可。 测试: (1) ifdown eth0/ifup eth0,过程期间,vip漂移、返回正常,可以ping通。 (2) 重启一台机器,过程中,vip漂移返回正常,可以ping通。 结论:以上方式可修正vip failback时,arp 缓存没正确flush的问题。 主要参考文档:Bug 13440962 - Different subnet failed to connect to vip after restart vip [ID 13440962.8] 如果考虑打补丁动作太大,临时解决方法:经过测试11.2想控制vip不做failback,racgwrap貌似不管用,控制方法可以参考文档

评论热议
请登录后评论。

登录 注册