歡迎您光臨本站 註冊首頁

ipmi fence問題請教大家

←手機掃碼閱讀     火星人 @ 2014-03-04 , reply:0

ipmi fence問題請教大家

內核:2.6.18-194.el5
OS:Red Hat Enterprise Linux Server release 5.5 (Tikanga)
2台IBM x3650 m2做RHCS,用x3650 m2伺服器的ipmi做fence設備
網路設置:
節點1:
public IP 10.72.86.121
private IP 10.1.1.1
ipmi IP 10.72.86.126


節點2:
public IP 10.72.86.122
private IP 10.1.1.2
ipmi IP 10.72.86.127

心跳private IP網口用直連網線連接起來,public IP網口和ipmiIP網口接到同一個交換機

現象1:在節點1上用fence_ipmilan 10.72.86.127命令可以fence節點2,讓節點2重啟了:
# fence_ipmilan -a 10.72.86.127
Rebooting machine @ IPMI:10.72.86.127...Done
但是messages日誌會報如下錯誤信息:
Jun 28 19:47:17 elndb1 fenced: agent "fence_ipmilan" reports: Rebooting machine @ IPMI:10.72.86.127...Failed
Jun 28 19:47:17 elndb1 fenced: fence "elndb2.eln.com" failed


現象2:在節點1上用fence_node elndb2.eln.com命令fence節點2失敗:
# fence_node elndb2.eln.com
agent "fence_ipmilan" reports: Rebooting machine @ IPMI:10.72.86.127...Failed
messages日誌報如下錯誤信息:
Jun 28 20:43:11 elndb1 fence_node: agent "fence_ipmilan" reports: Rebooting machine @ IPMI:10.72.86.127...Failed
Jun 28 20:43:11 elndb1 fence_node: Fence of "elndb2.eln.com" was unsuccessful



cluster.conf文件配置如下:
<?xml version="1.0"?>
<cluster alias="elndb_cluster" config_version="9" name="elndb_cluster">
<fence_daemon post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
<clusternode name="elndb1.eln.com" nodeid="1" votes="1">
<fence>
<method name="1">
<device name="ipmi1"/>
</method>
</fence>
</clusternode>
<clusternode name="elndb2.eln.com" nodeid="2" votes="1">
<fence>
<method name="1">
<device name="ipmi2"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman expected_votes="1" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_ipmilan" auth="none" ipaddr="10.72.86.126" login="USERID" name="ipmi1" passwd="PASSW0RD"/>
<fencedevice agent="fence_ipmilan" auth="none" ipaddr="10.72.86.127" login="USERID" name="ipmi2" passwd="PASSW0RD"/>
</fencedevices>
<rm>
<failoverdomains>
<failoverdomain name="elndbdomain" restricted="1">
<failoverdomainnode name="elndb1.eln.com" priority="1"/>
<failoverdomainnode name="elndb2.eln.com" priority="1"/>
</failoverdomain>
</failoverdomains>
<resources>
<ip address="10.72.86.130" monitor_link="1"/>
</resources>
<service autostart="0" domain="elndbdomain" name="elndb_svc" recovery="relocate">
<ip ref="10.72.86.130"/>
</service>
</rm>
</cluster>


從messages日誌看到的信息非常有限,請問一下大家該如何處理?
《解決方案》

該問題屬於你的USERID的許可權不夠,所以fence失敗。
《解決方案》

還有一個可能是:
<fencedevice agent="fence_ipmilan" auth="none" ipaddr="10.72.86.127" login="USERID" name="ipmi2" passwd="PASSW0RD"/>

中的auth="none"應該刪掉,存在這個會有問題。另外:你的用戶和密碼是系統默認的特權用戶應該是可以的

[火星人 ] ipmi fence問題請教大家已經有640次圍觀

http://coctec.com/docs/service/show-post-5561.html