在RHCS5中搭建RHCS HA 集群CMAN伺服器啟動不了的問題!
各位師兄,我是首次搭建集群,在網上找的資料.搭建完成後,CMAN伺服器啟動不了.請各位師兄幫我看看.非常感謝!
這是我 "cluster.conf"裡面的配置:
<?xml version="1.0" ?>
<cluster alias="sztc_cluster" config_version="19" name="sztc_cluster">
<fence_daemon post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
<clusternode name="dfs1.sztc" nodeid="1" votes="1">
<fence>
<method name="1">
<device modulename="" name="drac1"/>
</method>
</fence>
</clusternode>
<clusternode name="dfs3.sztc" nodeid="2" votes="1">
<fence>
<method name="1">
<device modulename="" name="drac3"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman expected_votes="1" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_drac" ipaddr="192.168.2.194" login="root" name="drac1" passwd="654321"/>
<fencedevice agent="fence_drac" ipaddr="192.168.2.195" login="root" name="drac3" passwd="654321"/>
</fencedevices>
<rm>
<failoverdomains>
<failoverdomain name="nfs_fd" ordered="0" restricted="1">
<failoverdomainnode name="dfs1.sztc" priority="1"/>
<failoverdomainnode name="dfs3.sztc" priority="1"/>
</failoverdomain>
</failoverdomains>
<resources>
<ip address="192.168.2.190" monitor_link="1"/>
<fs device="/dev/mapper/project_vg-project_lv" force_fsck="0" force_unmount="1" fsid="62521" fstype="ext3" mountpoint="/storage/sztc" name="project_ext3" options="acl" self_fence="1"/>
<fs device="dev/mapper/home_vg-home_lv" force_fsck="0" force_unmount="1" fsid="36334" fstype="ext3" mountpoint="/storage/home" name="home_ext3" options="acl,usrquota,grpquota" self_fence="1"/>
<nfsexport name="p_export"/>
<nfsexport name="h_export"/>
<nfsclient name="p_client" options="rw,sync" path="/storage/sztc" target="*"/>
<nfsclient name="h_client" options="rw,sync" path="/storage/home" target="*"/>
<nfsclient name="dell3_h_c" options="no_root_squash,rw,sync" path="/storage/home" target="192.168.2.149"/>
</resources>
<service domain="nfs_fd" name="nfs" recovery="restart">
<ip ref="192.168.2.190"/>
<fs ref="project_ext3"/>
<nfsexport ref="p_export"/>
<fs ref="home_ext3"/>
<nfsexport ref="h_export"/>
<nfsclient ref="dell3_h_c"/>
<nfsclient ref="h_client"/>
</service>
</rm>
</cluster>
然後我啟動 用於RHEL5 的2個服務,其中一個 rgmanager 服務能夠啟動
啟動CMAN服務報下面的錯誤....
[email protected] ~]# /etc/init.d/cman restart
Stopping cluster:
Stopping fencing... done
Stopping cman... done
Stopping ccsd... done
Unmounting configfs... done
[ OK ]
Starting cluster:
Loading modules... done
Mounting configfs... done
Starting ccsd... done
Starting cman... failed
cman not started: Can't determine address family of nodename /usr/sbin/cman_tool: aisexec daemon didn't start
#
請大家給我點意見....
《解決方案》
回復 1# liujixuan
把 /var/log/message 貼上來
《解決方案》
回復 2# nagaregawa
這是我截取的后30條/var/log/message
Feb 24 08:32:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77250 seconds.
Feb 24 08:33:14 dfs3 ccsd: Unable to connect to cluster infrastructure after 77280 seconds.
Feb 24 08:33:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77310 seconds.
Feb 24 08:34:14 dfs3 ccsd: Unable to connect to cluster infrastructure after 77340 seconds.
Feb 24 08:34:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77370 seconds.
Feb 24 08:35:14 dfs3 ccsd: Unable to connect to cluster infrastructure after 77400 seconds.
Feb 24 08:35:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77430 seconds.
Feb 24 08:36:14 dfs3 ccsd: Unable to connect to cluster infrastructure after 77460 seconds.
Feb 24 08:36:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77490 seconds.
Feb 24 08:37:14 dfs3 ccsd: Unable to connect to cluster infrastructure after 77520 seconds.
Feb 24 08:37:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77550 seconds.
Feb 24 08:38:14 dfs3 ccsd: Unable to connect to cluster infrastructure after 77580 seconds.
Feb 24 08:38:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77610 seconds.
Feb 24 08:39:14 dfs3 ccsd: Unable to connect to cluster infrastructure after 77640 seconds.
Feb 24 08:39:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77670 seconds.
Feb 24 08:40:11 dfs3 ccsd: Stopping ccsd, SIGTERM received.
Feb 24 08:40:12 dfs3 ccsd: Starting ccsd 2.0.84:
Feb 24 08:40:12 dfs3 ccsd: Built: Apr 15 2008 16:19:15
Feb 24 08:40:12 dfs3 ccsd: Copyright (C) Red Hat, Inc. 2004 All rights reserved.
Feb 24 08:40:12 dfs3 ccsd: cluster.conf (cluster name = sztc_cluster, version = 17) found.
Feb 24 08:40:15 dfs3 openais: AIS Executive Service RELEASE 'subrev 1358 version 0.80.3'
Feb 24 08:40:15 dfs3 openais: Copyright (C) 2002-2006 MontaVista Software, Inc and contributors.
Feb 24 08:40:15 dfs3 openais: Copyright (C) 2006 Red Hat, Inc.
Feb 24 08:40:15 dfs3 openais: AIS Executive Service: started and ready to provide service.
《解決方案》
回復 2# nagaregawa
謝謝你的幫忙,我終於找到 為什麼 CMAN 服務啟動不了的原因了..
我把 /etc/hosts 文件裡面的 : 192.168.2.191 dfs3.sztc dfs3 寫成了
另外一個節點的名字了 : 192.168.2.190 dfs1.sztc dfs1 難怪啟動CMAN服務的時候報這個錯. 現在可以正常啟動了
《解決方案》
錯誤:
[email protected] ~]# /etc/init.d/cman restart
Stopping cluster:
Stopping fencing... done
Stopping cman... done
Stopping ccsd... done
Unmounting configfs... done
[ OK ]
Starting cluster:
Loading modules... done
Mounting configfs... done
Starting ccsd... done
Starting cman... failed
cman not started: Can't determine address family of nodename /usr/sbin/cman_tool: aisexec daemon didn't start
#