RHCS 服務啟動失敗,請幫忙
redhat 6.4的版本
就一個服務組 ,這個服務組中就添加一個IP資源。
但是 IP資源一直起不來,
狀態如下:
# clustat
Cluster Status for cluster_1 @ Fri Jun 14 09:48:06 2013
Member Status: Quorate
Member Name ID Status
------ ---- ---- ------
HP-1 1 Online, Local, rgmanager
HP-2 2 Online, rgmanager
Service Name Owner (Last) State
------- ---- ----- ------ -----
service:server-1 none recovering
#
配置如下:
# cat /etc/cluster/cluster.conf
<?xml version="1.0"?>
<cluster config_version="13" name="cluster_1">
<clusternodes>
<clusternode name="HP-1" nodeid="1">
<fence>
<method name="Method-1">
<device name="fence-1"/>
</method>
</fence>
</clusternode>
<clusternode name="HP-2" nodeid="2">
<fence>
<method name="Method-2">
<device name="fence-2"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman expected_votes="1" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_ipmilan" ipaddr="199.5.211.156" login="mals" name="fence-1" passwd="malslbs!@#"/>
<fencedevice agent="fence_ipmilan" ipaddr="199.5.211.157" login="mals" name="fence-2" passwd="malslbs!@#"/>
</fencedevices>
<rm>
<failoverdomains>
<failoverdomain name="domain-1" ordered="1" restricted="1">
<failoverdomainnode name="HP-1" priority="1"/>
<failoverdomainnode name="HP-2" priority="10"/>
</failoverdomain>
</failoverdomains>
<resources>
<ip address="199.5.211.158/255.255.255.0" sleeptime="10"/>
</resources>
<service domain="domain-1" exclusive="1" name="server-1" recovery="relocate">
<ip ref="199.5.211.158/255.255.255.0"/>
</service>
</rm>
</cluster>
《解決方案》
補充一下:
啟動資源時:/var/log/messages 的日誌如下
Jun 14 10:10:13 HP-1 ricci: Executing '/usr/bin/virsh nodeinfo'
Jun 14 10:10:13 HP-1 ricci: Executing '/usr/libexec/ricci/ricci-worker -f /var/lib/ricci/queue/923986086'
Jun 14 10:10:13 HP-1 ricci: Executing '/usr/bin/virsh nodeinfo'
Jun 14 10:10:13 HP-1 ricci: Executing '/usr/libexec/ricci/ricci-worker -f /var/lib/ricci/queue/1444572861'
Jun 14 10:10:13 HP-1 modcluster: Starting service: server-1 on node
Jun 14 10:10:13 HP-1 ricci: Executing '/usr/bin/virsh nodeinfo'
Jun 14 10:10:14 HP-1 ricci: Executing '/usr/libexec/ricci/ricci-worker -f /var/lib/ricci/queue/79635758'
Jun 14 10:10:14 HP-1 ricci: Executing '/usr/libexec/ricci/ricci-worker -f /var/lib/ricci/queue/485173658'
Jun 14 10:10:17 HP-1 ricci: Executing '/usr/bin/virsh nodeinfo'
然後web頁面上就
顯示:
Starting cluster "cluster_1" service "server-1" from node "HP-1" failed: server-1 is in unknown state 118
《解決方案》
啟動服務的時候:
《解決方案》
# tail -f /var/log/messages
Jun 14 10:33:03 HP-1 rgmanager: Executing /etc/init.d/httpd stop
Jun 14 10:33:04 HP-1 rgmanager: Service service:server-1 is stopped
Jun 14 10:33:54 HP-1 rgmanager: Starting stopped service service:server-1
Jun 14 10:33:54 HP-1 rgmanager: start on ip "199.5.211.158/255.255.255.0" returned 1 (generic error)
Jun 14 10:33:54 HP-1 rgmanager: #68: Failed to start service:server-1; return value: 1
Jun 14 10:34:03 HP-1 rgmanager: 199.5.211.158/255.255.255.0 is not configured
Jun 14 10:34:03 HP-1 rgmanager: Stopping service service:server-1
Jun 14 10:34:03 HP-1 rgmanager: Executing /etc/init.d/httpd stop
Jun 14 10:34:03 HP-1 rgmanager: Service service:server-1 is recovering
Jun 14 10:34:04 HP-1 rgmanager: Service service:server-1 is stopped
《解決方案》
回復 4# hejia0105
Jun 14 10:33:54 HP-1 rgmanager: start on ip "199.5.211.158/255.255.255.0" returned 1 (generic error)
這個是bug 要寫成 :"199.5.211.158/24,即可啟動。