ORA-29702:error occurred in Cluster Group Service operation错误解决
發表于: 2011.05.10 15:43
分類: Oracle
出處: http://djb1008.itpub.net/post/42280/517654
---------------------------------------------------------------
?
?
一. 故障描述
使用一個已經安裝了AIX 6.1+HACMP+ORACLE RAC 10.2.0.5 環境創建備份,然后在一個新的POWER 5上使用這個備份進行操作系統全恢復.系統恢復后,ORACLE的環境已經有了,現在需要在這個環境下建立一個single db(單實例DB).
筆者嘗試通過手工建立單實例DB的方法,啟動數據庫實例到nomount狀態時,報錯:
SQL> startup nomount;
ORA-29702: error occurred in Cluster Group Service operation
無法進行下面的數據庫創建工作.
?
二. 故障分析
首先查看數據庫警告日志文件,內容如下:
?
$ more alert_test1.log
Tue May 10 07:28:58 GMT+08:00 2011Starting ORACLE instance (normal)
sskgpgetexecname failed to get name
LICENSE_MAX_SESSION = 0
LICENSE_SESSIONS_WARNING = 0
Tue May 10 07:28:59 GMT+08:00 2011Errors in file /oracle/admin/test1/udump/test1_ora_2425116.trc:
ORA-27504: IPC error creating OSD context
ORA-27300: OS system dependent operation:skgxnqtsz failed with status: 0
ORA-27301: OS failure message: Error 0
ORA-27302: failure occurred at: SKGXN not av
clsssinit ret = 21
interconnect information is not available from OCR
WARNING: No cluster interconnect has been specified. Depending on
the communication driver configured Oracle cluster traffic
may be directed to the public interface of this machine.
Oracle recommends that RAC clustered databases be co# more
: A file or directory in the path name does not exist.
?
數據庫警告日志中涉及了/oracle/admin/test1/udump/test1_ora_2425116.trc,查看改trace文件,內容如下:
$ more /oracle/admin/test1/udump/test1_ora_2425116.trc
/oracle/admin/test1/udump/test1_ora_2425116.trc
Oracle Database 10g Enterprise Edition Release 10.2.0.5.0 - 64bit Production
With the Partitioning, Real Application Clusters, OLAP, Data Mining
and Real Application Testing options
Oracle process number: 0
Unix process pid: 2425116, p_w_picpath: oracle@localhost
?
*** 2011-05-10 07:28:58.792
Number of resource hash buckets is 16
Parsing user specified table space list to be ignored
2011-05-10 07:28:59.810: [ COMMCRS]clsc_set_clsd_NS_trace: called before init completed
2011-05-10 07:28:59.906: [ CSSCLNT]clsssinit: error(32 PROC-32: Cluster Ready Services on the local node is not running Messaging er
ror [9]) in OCR initialization
ORA-27504: IPC error creating OSD context
ORA-27300: OS system dependent operation:skgxnqtsz failed with status: 0
ORA-27301: OS failure message: Error 0
ORA-27302: failure occurred at: SKGXN not av
Number of resource hash buckets is 16
* kjfcnfy: kjinumbuckets = 8
Dynamic strand is set to TRUE
Running with 2 shared and 48 private strand(s). Zero-copy redo is FALSE
?
在metalink上搜索了ORA-29702,找到了一個非常有價值的文章,內容如下:
Starting Instance Fails with ORA-29702 [ID 216030.1]
修改時間 16-SEP-2010 類型 PROBLEM 狀態 PUBLISHED
fact: Oracle Server Enterprise Edition 8
fact: Oracle Parallel Server (OPS)
fact: AIX-Based Systems
symptom: Starting database fails
symptom: ORA-29702: error occurred in Cluster Group Service operation
symptom: Mounting database in Non-OPS (exclusive) mode
symptom: Environment not configured for OPS
cause: Oracle in IBM RS6000 SP installs the Parallel Server Option as the
default option. As a result, Oracle tries to communicate with GMS or Cluster
Manager during startup, but the environment is not configured to work in
Parallel Server mode.
?
fix:
Relink Oracle to disable Parallel Server Option:
$ cd $ORACLE_HOME/rdbms/lib
$ make -f ins_rdbms.mk no_parropt
$ make -f ins_rdbms.mk install
?
這個文檔說的情況與本問題的情況吻合,需要解決的問題就是取消并行模式(創建單節點DB,而非RAC DB).
?
?
?
三. 解決方法
執行metalink文檔中的fix的命令,取消并行服務器模式。
?
$ cd $ORACLE_HOME/rdbms/lib
$ make -f ins_rdbms.mk no_parropt
rm -f /oracle/product/10.2.0/db/lib/libskgxp10.a
cp /oracle/product/10.2.0/db/lib//libskgxpd.a /oracle/product/10.2.0/db/lib/libskgxp10.a
rm -f /oracle/product/10.2.0/db/lib/libskgxn2.a
cp /oracle/product/10.2.0/db/lib//libskgxns.a /oracle/product/10.2.0/db/lib/libskgxn2.a
/bin/ar -X64 d /oracle/product/10.2.0/db/rdbms/lib/libknlopt.a kcsm.o
/bin/ar -X64 cr /oracle/product/10.2.0/db/rdbms/lib/libknlopt.a /oracle/product/10.2.0/db/rdbms/lib/ksnkcs.o
Target "no_parropt" is up to date.
?
$ make -f ins_rdbms.mk install
chmod 755 /oracle/product/10.2.0/db/bin
rm -f oracle dbv tstshm maxmem orapwd dbfsize cursize genoci extproc extproc32 hsalloci hsots hsdepxa dgmgrl dumpsga mapsga osh sbttest expdp impdp imp exp sqlldr rman hsodbc tg4sybs nid extjob extjobo genezi ikfod grdcscan /oracle/product/10.2.0/db/rdbms/lib/ksms.s /oracle/product/10.2.0/db/rdbms/lib/ksms.o
- Linking DB*Verify utility (dbv)
?
……
?
/bin/ar -X64 t /oracle/product/10.2.0/db/rdbms/lib/libknlopt.a | grep '^'kcsm.o > /dev/null 2>&1 ; then echo "-lha_gs_r -lha_em_r -lpthreads"; fi` -locijdbcst10 -lwwg -bpT:0x100000000 -bpD:0x110000000 -bforceimprw
mv -f /oracle/product/10.2.0/db/bin/oracle /oracle/product/10.2.0/db/bin/oracleO
mv /oracle/product/10.2.0/db/rdbms/lib/oracle /oracle/product/10.2.0/db/bin/oracle
chmod 6751 /oracle/product/10.2.0/db/bin/oracle
Target "install" is up to date.
?
?
?
SQL> startup nomount;
ORACLE instance started.
?
Total System Global Area 1073741824 bytes
Fixed Size 2101912 bytes
Variable Size 545262952 bytes
Database Buffers 524288000 bytes
Redo Buffers 2088960 bytes
?
到此,問題得到解決,數據庫可以啟動到nomount狀態,可以進行下面的手工創建數據庫的操作了。
轉載于:https://blog.51cto.com/itbull/1131184
總結
以上是生活随笔為你收集整理的ORA-29702:error occurred in Cluster Group Service operation错误解决的全部內容,希望文章能夠幫你解決所遇到的問題。
- 上一篇: Oracle SQL语句执行步骤
- 下一篇: ser,ver