ITPub博客

首页 > Linux操作系统 > Linux操作系统 > HP-unix下安装11G RAC出现的问题(四)

HP-unix下安装11G RAC出现的问题(四)

原创 Linux操作系统 作者:anycall2010 时间:2009-06-21 08:18:42 0 删除 编辑

安装CRS的时候,比较顺利,但是在运行脚本: ”/home/oracle/crs/root.sh“的时候,始终启动不了CRS。错误现象如下:

Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
Failure at final check of Oracle CRS stack.
10

察看相关错误:

# cat css382.log
Oracle Database 11g CRS Release 11.1.0.6.0 - Production Copyright 1996, 2007 Oracle. All rights reserved.
2009-06-19 16:21:26.807: [ CSSCLNT][1]clsssInitNative: failed to connect to (ADDRESS=(PROTOCOL=ipc)(KEY=OCSSD_LL_bl870_1_)), rc 9

我一直怀疑是网卡或者某个通信有问题。所以,一直对刀片小型机是否能安装ORACLE存在怀疑。并且刀片小型机的网卡结构也是比较复杂的。

求助METALINK,METALINK给我一个解决办法:跟踪执行过程,察看ORACLE究竟是在等待什么?或者那个地方出问题。

进入DEBUG模式:

1.对需要调试文件进行备份:

cp $ORA_CRS_HOME/install/rootinstall $ORA_CRS_HOME/install/rootinstall.bak
cp $ORA_CRS_HOME/install/rootconfig $ORA_CRS_HOME/install/rootconfig.bak

2.修改2个配置文件:

配置文件rootinstall 和rootconfig 脚本中,添加-X:

比如: 
#!/bin/sh -x
#
# rootinstall.sbs for CRS installs

3.运行脚本:

script. /tmp/rootsh.log
./root.sh

4.察看系统日志:

tail -f /var/adm/syslog/syslog.log
# cd /home/oracle/oraInventory/logs
# tail -f installActions2009-06-20_03-23-03PM.log

具体做法如下调试记录:

# tail -f syslog.log
Jun 20 15:54:03 bl870_1  above message repeats 47 times
Jun 20 15:54:05 bl870_1 syslog: Oracle Cluster Ready Services waiting for HP-UX Service Guard to start.
Jun 20 15:54:51 bl870_1 vmunix: Dead gateway detection can't ping the last remaining default gateway at 0x23015afe .See ndd -h ip_ire_gw_probe for more info
Jun 20 15:56:25 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network
Jun 20 15:57:03 bl870_1 SQLAnywhere(veritas_dbms3_bl870_1): Starting checkpoint of "vxdbms" (vxdbms.db) at Sat Jun 20 2009 15:57
Jun 20 15:57:04 bl870_1 SQLAnywhere(veritas_dbms3_bl870_1): Finished checkpoint of "vxdbms" (vxdbms.db) at Sat Jun 20 2009 15:57
Jun 20 15:56:25 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network
Jun 20 15:57:51 bl870_1 vmunix: Dead gateway detection can't ping the last remaining default gateway at 0x23015afe .See ndd -h ip_ire_gw_probe for more info
Jun 20 16:01:25 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network
Jun 20 16:07:34 bl870_1 syslog: libtt[23096]: ttdt_Xt_input_handler(): tttk_message_receive(): TT_ERR_NOMP      No ttsession process is running, probably because tt_open() has not been called yet. If this code is returned from tt_open() it means ttsession could not be started, which generally means ToolTalk is not installed on this system
.

根据系统日志,我发现有2点比较可疑,进行分析:

1、Jun 20 15:54:05 bl870_1 syslog: Oracle Cluster Ready Services waiting for HP-UX Service Guard to start.

oracle究竟在等待什么?HP的SG集群软件?我的环境是用VERTIAS做的并发卷组,没有用到HP的SG?经过求助有经验的工程师,后来才知道,VERTIAS做集群的时候,需要安装补丁。

cp  /opt/VRTSvcs/rac/patch/init.cssd-11gR1.patch /home/oracle/crs/css/admin/
patch init.cssd < init.cssd-11gR1.patch

结果问题解决。

2、Jun 20 15:56:25 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network

我曾经一直对这个地方怀疑,一直觉得HP的刀片有问题,结果竟然这个地方是正常的。OH!OH!

正确安装情况记录:

节点1执行情况:

# tail -f /var/adm/syslog/syslog.log
Jun 20 17:13:41 bl870_1 vmunix: GAB INFO V-15-1-20036 Port f gen   64ba0d membership 01
Jun 20 17:13:41 bl870_1 vmunix: GLM recovery : gen 64ba0d mbr 3 0 0 0
Jun 20 17:13:41 bl870_1 vxfsckd: vxfs: vxfsckd started
Jun 20 17:14:45 bl870_1 vxfsckd: /dev/vx/rdsk/oradg/oravol2: log replay in progress
Jun 20 17:14:45 bl870_1 vxfsckd: /dev/vx/rdsk/oradg/oravol2: replay complete - marking super-block as CLEAN
Jun 20 17:14:45 bl870_1 vxfsckd: /dev/vx/rdsk/oradg/oravol3: log replay in progress
Jun 20 17:14:45 bl870_1 vxfsckd: /dev/vx/rdsk/oradg/oravol3: replay complete - marking super-block as CLEAN
Jun 20 17:14:45 bl870_1 vxfsckd: /dev/vx/rdsk/oradg/oravol1: log replay in progress
Jun 20 17:14:45 bl870_1 vxfsckd: /dev/vx/rdsk/oradg/oravol1: replay complete - marking super-block as CLEAN
Jun 20 17:17:33 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network
Jun 20 17:29:17 bl870_1 sshd[8478]: SSH: Server;Ltype: Version;Remote: 35.1.90.21-50545;Protocol: 2.0;Client: OpenSSH_5.1p1+sftpfilecontrol-v1.2-hpn13v5
Jun 20 17:27:33 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network
Jun 20 17:30:11 bl870_1  above message repeats 5 times
Jun 20 17:32:06 bl870_1 sshd[9015]: SSH: Server;Ltype: Version;Remote: 35.1.90.88-1383;Protocol: 2.0;Client: SecureCRT_5.1.2 (build 274) SecureCRT
Jun 20 17:32:08 bl870_1 sshd[9015]: Accepted password for root from 35.1.90.88 port 1383 ssh2
Jun 20 17:32:33 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network
Jun 20 17:32:47 bl870_1 SQLAnywhere(veritas_dbms3_bl870_1): Starting checkpoint of "vxdbms" (vxdbms.db) at Sat Jun 20 2009 17:32
Jun 20 17:32:47 bl870_1 SQLAnywhere(veritas_dbms3_bl870_1): Finished checkpoint of "vxdbms" (vxdbms.db) at Sat Jun 20 2009 17:32
Jun 20 17:45:29 bl870_1 su: + 2 root-oracle
Jun 20 17:45:29 bl870_1 root: Oracle Cluster Ready Services starting by user request.
Jun 20 17:45:40 bl870_1 su: + tty?? root-oracle
Jun 20 17:42:33 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network
Jun 20 17:45:41 bl870_1  above message repeats 5 times
Jun 20 17:45:41 bl870_1 syslog: Cluster Ready Services completed waiting on dependencies.
Jun 20 17:45:41 bl870_1 syslog: Running CRSD with TZ =
Jun 20 17:45:41 bl870_1 syslog: Oracle CSS Family monitor starting.
Jun 20 17:45:41 bl870_1 syslog: Cluster Ready Services completed waiting on dependencies.
Jun 20 17:45:42 bl870_1  above message repeats 2 times
Jun 20 17:45:42 bl870_1 syslog: Oracle CSS restart. 0, 2
Jun 20 17:47:33 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network
Jun 20 17:45:42 bl870_1 su: + tty?? root-oracle
Jun 20 17:50:11 bl870_1  above message repeats 5 times
Jun 20 17:47:33 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network
Jun 20 17:52:33 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network
Jun 20 17:52:49 bl870_1 SQLAnywhere(veritas_dbms3_bl870_1): Starting checkpoint of "vxdbms" (vxdbms.db) at Sat Jun 20 2009 17:52
Jun 20 17:52:49 bl870_1 SQLAnywhere(veritas_dbms3_bl870_1): Finished checkpoint of "vxdbms" (vxdbms.db) at Sat Jun 20 2009 17:52
Jun 20 17:57:33 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network
Jun 20 18:01:09 bl870_1  above message repeats 3 times
Jun 20 18:02:33 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network
Jun 20 18:03:15 bl870_1 sshd[22125]: SSH: Server;Ltype: Version;Remote: 35.1.90.21-52661;Protocol: 2.0;Client: OpenSSH_5.1p1+sftpfilecontrol-v1.2-hpn13v5
Jun 20 18:04:09 bl870_1 su: + 2 root-oracle
Jun 20 18:07:33 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network
Jun 20 18:10:12 bl870_1  above message repeats 3 times
Jun 20 18:12:33 bl870_1 vmunix: LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 1 on the same network
Jun 20 18:12:51 bl870_1 SQLAnywhere(veritas_dbms3_bl870_1): Starting checkpoint of "vxdbms" (vxdbms.db) at Sat Jun 20 2009 18:12
Jun 20 18:12:51 bl870_1 SQLAnywhere(veritas_dbms3_bl870_1): Finished checkpoint of "vxdbms" (vxdbms.db) at Sat Jun 20 2009 18:1

节点2情况:

# /home/oracle/crs/root.sh
WARNING: directory '/home/oracle' is not owned by root
Checking to see if Oracle CRS stack is already configured
Checking to see if any 9i GSD is up

Setting the permissions on OCR backup directory
Setting up Network socket directories
Oracle Cluster Registry configuration upgraded successfully
The directory '/home/oracle' is not owned by root. Changing owner to root
clscfg: EXISTING configuration version 4 detected.
clscfg: version 4 is 11 Release 1.
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node :
node 0: bl870_1 dbp-priv bl870_1
node 1: bl870_2 dbs-priv bl870_2
clscfg: Arguments check out successfully.

NO KEYS WERE WRITTEN. Supply -force parameter to override.
-force is destructive and will destroy any previous cluster
configuration.
Oracle Cluster Registry for cluster has already been initialized
Startup will be queued to init within 30 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
Cluster Synchronization Services is active on these nodes.
        bl870_1
        bl870_2
Cluster Synchronization Services is active on all the nodes.
Waiting for the Oracle CRSD and EVMD to start
Oracle CRS stack installed and running under init(1M)
Running vipca(silent) for configuring nodeapps

Creating VIP application resource on (2) nodes...
Creating GSD application resource on (2) nodes...
Creating ONS application resource on (2) nodes...
Starting VIP application resource on (2) nodes...
Starting GSD application resource on (2) nodes1:CRS-0215: Could not start resource 'ora.bl870_1.gsd'.
Check the log file "/home/oracle/crs/log/bl870_1/racg/ora.bl870_1.gsd.log" for more details
.1:CRS-0215: Could not start resource 'ora.bl870_2.gsd'.
Check the log file "/home/oracle/crs/log/bl870_2/racg/ora.bl870_2.gsd.log" for more details
..
Starting ONS application resource on (2) nodes...


Done.
这个好像11G基本都起不来,不用管它好了,就此CRS安装结束。

 

 



 

来自 “ ITPUB博客 ” ,链接:http://blog.itpub.net/8334342/viewspace-607062/,如需转载,请注明出处,否则将追究法律责任。

请登录后发表评论 登录
全部评论

注册时间:2008-06-24

  • 博文量
    182
  • 访问量
    559395