解决SAP PI Cluster系统故障
文檔已經(jīng)交付給用戶了,這里總結(jié)一下:
SAP PI的PI服務(wù)當(dāng)在MSCS群集的node1和node2中都啟動(dòng)的時(shí)候,MSCS故障,所有PI資源組會(huì)在node1和node2中來回切換,導(dǎo)致Oracle OFS資源和MSCS資源也切換,由于PI占用內(nèi)存很大,有30GB內(nèi)存,這樣的自動(dòng)來回切換約8次后,pubilc網(wǎng)卡down,up多次崩潰。
由于MSCS切換和OFS資源切換都沒有問題,檢查MSCS的集群配置參數(shù),無誤。
檢查操作系統(tǒng),看是否有不利于MSCS的補(bǔ)丁,無誤
檢查網(wǎng)絡(luò)設(shè)置和網(wǎng)卡屬性中的BOE offload,RSS,speed,無誤
檢查針對(duì)WINDOWS 2003 R2 SP2中的伸縮端縮放,補(bǔ)丁已達(dá),無誤
檢查public和private的網(wǎng)絡(luò)千兆交換機(jī)環(huán)境,無誤
最后發(fā)現(xiàn):
node1和node2的網(wǎng)卡 HP NC357i驅(qū)動(dòng)都是最新的556版本,而node1 的網(wǎng)卡固件是 527版本,node2的網(wǎng)卡固件是534,經(jīng)查確認(rèn),527固件和556驅(qū)動(dòng)不匹配。找到問題了
解決,驅(qū)動(dòng)由于是最新,不必重裝驅(qū)動(dòng),刷固件
C:\SWSetup\SP50817>nxflash_x64.exe -i private --all
0/8 - Init
*** Currently in flash ***
Board Type?????? : HP NC375i Integrated Quad Port Multifunction Gigabit Server Adapter
Firmware Version : 4.0.534
MAC Address 0??? : 68:B5:99:C4:B2:B8
MAC Address 1??? : 68:B5:99:C4:B2:B9
MAC Address 2??? : 68:B5:99:C4:B2:BA
MAC Address 3??? : 68:B5:99:C4:B2:BB
Serial Number??? : 牋牋牋牋牋牋牋牋牋牋牋牋牋牋牋牋??
NIC binary romimage found in C:\SWSetup\SP50817
Rom Image??????? : C:\SWSetup\SP50817\phantom_romimage
1/8 - Extracting Romimage
Firmware version From Board: 4.0.534
Firmware version From Romimage: 4.0.539
WARNING: This operation will take the NIC offline.
Do you wish to upgrade? (Y/N) y
Disabling devices
Disabling devices
Disabling devices
Disabling devices
Driver Loaded in Quiesce mode
2/8 - Restoring License
?100%? - DONE
?100%? - DONE
No vNIC property area in romimage
No VPD area in romimage
3/8 - Calculating MD5
?100%? - DONE
4/8 - Backing up current flash
?100%? - DONE
Backup file : "flashbackup__v4.0.534_Sat-Oct-13-22-06-50-2012" - completed successfully.
5/8 - Updating flash
WARNING: This is a very sensitive operation.
Do not interrupt until operation is complete.
setting up the flash_write
?100%? - DONE
6/8 - Verifying Flash MD5
Flashing completed successfully.
Reboot system for firmware to take effect
Enabling devices
Enabling devices
Enabling devices
Enabling devices
Driver Loaded in Normal mode
7/8 - Performing cleanup
8/8 - Finished
C:\SWSetup\SP50817>
在2號(hào)機(jī)node2上
C:\SWSetup\SP50817>nxflash_x64.exe -i private --all
0/8 - Init
*** Currently in flash ***
Board Type?????? : HP NC375i Integrated Quad Port Multifunction Gigabit Server A
dapter
Firmware Version : 4.0.527
MAC Address 0??? : 68:B5:99:B3:3C:58
MAC Address 1??? : 68:B5:99:B3:3C:59
MAC Address 2??? : 68:B5:99:B3:3C:5A
MAC Address 3??? : 68:B5:99:B3:3C:5B
Serial Number??? : 牋牋牋牋牋牋牋牋牋牋牋牋牋牋牋牋??
NIC binary romimage found in C:\SWSetup\SP50817
Rom Image??????? : C:\SWSetup\SP50817\phantom_romimage
1/8 - Extracting Romimage
Firmware version From Board: 4.0.527
Firmware version From Romimage: 4.0.539
WARNING: This operation will take the NIC offline.
Do you wish to upgrade? (Y/N) y
Disabling devices
Disabling devices
Disabling devices
Disabling devices
Driver Loaded in Quiesce mode
2/8 - Restoring License
?100%? - DONE
?100%? - DONE
No vNIC property area in romimage
No VPD area in romimage
3/8 - Calculating MD5
?100%? - DONE
4/8 - Backing up current flash
?100%? - DONE
Backup file : "flashbackup__v4.0.527_Sat-Oct-13-21-06-28-2012" - completed succe
ssfully.
5/8 - Updating flash
WARNING: This is a very sensitive operation.
Do not interrupt until operation is complete.
setting up the flash_write
?100%? - DONE
6/8 - Verifying Flash MD5
Flashing completed successfully.
Reboot system for firmware to take effect
Enabling devices
Enabling devices
Enabling devices
Enabling devices
Driver Loaded in Normal mode
7/8 - Performing cleanup
8/8 - Finished
C:\SWSetup\SP50817>
問題解決!
后來和采購確認(rèn),兩臺(tái)機(jī)器來源采購相差半年,不是同一批次。2號(hào)機(jī)是開發(fā)機(jī),半年后才新購1號(hào)機(jī)生產(chǎn)機(jī),然后實(shí)施的時(shí)候開發(fā)機(jī)和生產(chǎn)機(jī)做MSCS PI。
看來實(shí)施MSCS的人技術(shù)很毛躁,不靠譜。Windows企業(yè)環(huán)境要更加精細(xì)化,對(duì)技術(shù)素養(yǎng)要更高,因?yàn)楹芏噱e(cuò)誤你無法深入內(nèi)核解決,我不可能遇到問題就看dump崩潰核心轉(zhuǎn)儲(chǔ)文件,或者拿出windbg就開工。——當(dāng)然這是最后的辦法
總結(jié)
以上是生活随笔為你收集整理的解决SAP PI Cluster系统故障的全部?jī)?nèi)容,希望文章能夠幫你解決所遇到的問題。
- 上一篇: mysql 查询超过60分钟的_mysq
- 下一篇: OA系统