Thanks for the response, Frank. I have retried this a few times now,
and rebooted the machine. Still no luck, I have turned on tracing, and
here is the error I see...
nsglbgetRSPidx: returning ecode=0
sntpcall: only 0 bytes read
sntpcall: Can't read from pipe; err[1] = 32
nserror: nsres: id=6, op=72, ns=12547, ns2=12560; nt[0]=517, nt[1]=32,
nt[2]=0; ora[0]=0, ora[1]=0, ora[2]=0
Does that shed some more light?
> Thanks for the response, Frank. I have retried this a few times now,
> and rebooted the machine. Still no luck, I have turned on tracing, and
[quoted text clipped - 7 lines]
>
> Does that shed some more light?
12547, 00000, "TNS:lost contact"
// *Cause: Partner has unexpectedly gone away, usually during process
// startup.
// *Action: Investigate partner application for abnormal termination. On an
// Interchange, this can happen if the machine is overloaded.
The 12560 is the generic error reported back. You can do this, too, btw,
oerr tns 12547
or
oerr ora 1401
As you state all system (and kernel?) parameters are correct, I don't
quite know where to take it from here.
Did you try starting the services manually, one-by-one?
iirc, opmnctl verbose status will show you what (or s/status/getstate/g)
See if one of the others fails with a more meaningful error message

Signature
Regards,
Frank van Bortel
Top-posting is one way to shut me up...
Nick - 24 Feb 2006 21:20 GMT
I have found some processes that appear to be hung... the following is
output from ps -fu oracle:
UID PID PPID C STIME TTY TIME CMD
oracle 3845 1 0 15:29:18 ? 0:00 oraclempris10g
(DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))
oracle 3859 3855 0 15:33:08 pts/2 0:00 -ksh
oracle 418 1 0 14:02:12 ? 0:00 ora_pmon_mpris10g
oracle 420 1 0 14:02:12 ? 0:00 ora_mman_mpris10g
oracle 422 1 0 14:02:12 ? 0:01 ora_dbw0_mpris10g
oracle 424 1 0 14:02:12 ? 0:01 ora_lgwr_mpris10g
oracle 426 1 0 14:02:13 ? 0:04 ora_ckpt_mpris10g
oracle 428 1 0 14:02:13 ? 0:05 ora_smon_mpris10g
oracle 430 1 0 14:02:13 ? 0:00 ora_reco_mpris10g
The ? lead me to believe these processes are hung, the main issue is
that I cannot kill them b/c their parent process is pid 1 - init.
Any advice on how to proceed?
Jim Smith - 25 Feb 2006 06:51 GMT
>I have found some processes that appear to be hung... the following is
>output from ps -fu oracle:
[quoted text clipped - 15 lines]
>
>Any advice on how to proceed?
These are mostly oracle database background processes and are almost
certainly not hung. The ? just means they are not attached to a
terminal. Under no circumstances should they be killed.
The first one (PID 3845) is an oracle client shadow process and its
parent ought to be a sqlplus session or something similar and might be
hung.
These are probably not related to your problem.
If you want to get rid of the hung process, kill -9 3845 as root ought
to get rid of it and you can then bounce the database if you want.

Signature
Jim Smith
I'm afraid you've mistaken me for someone who gives a damn.
Nick - 27 Feb 2006 14:46 GMT
Thanks all for your reply. I've still found no resolution for this
issue. I've examined every potential resource shortfall I can think
of, and everything appears to be fine. I have noticed, however, that
an effective group ID has been assigned to my oracle user. I do not
recall seeing this in the past.
Below...
$ id
uid=101(oracle) gid=100(dba) egid=2(bin)
Could this be causing the problems I am having?
Frank van Bortel - 27 Feb 2006 18:15 GMT
> Thanks all for your reply. I've still found no resolution for this
> issue. I've examined every potential resource shortfall I can think
[quoted text clipped - 8 lines]
>
> Could this be causing the problems I am having?
Yes

Signature
Regards,
Frank van Bortel
Top-posting is one way to shut me up...
Nick - 28 Feb 2006 15:26 GMT
Thank you all for your help and insight. Turns out that the
permissions have gotten out of wack on this box. The oracle user was
using /bin/ksh - which somehow had a setgid bit in it's permissions,
and the group owner was bin - hence my egid of 2-bin. I switched the
oracle user over to /bin/sh - and was able to bring AS10g up with no
issues.
I was able to circumvent the issue - but still can't figure out why the
permissions went bad.
Thanks again....
//NC
Nick - 24 Feb 2006 21:23 GMT
I have found some processes that appear to be hung... the following is
output from ps -fu oracle:
UID PID PPID C STIME TTY TIME CMD
oracle 3845 1 0 15:29:18 ? 0:00
oraclempris10g (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))
oracle 3859 3855 0 15:33:08 pts/2 0:00 -ksh
oracle 418 1 0 14:02:12 ? 0:00
ora_pmon_XXXX
oracle 420 1 0 14:02:12 ? 0:00
ora_mman_XXXX
oracle 422 1 0 14:02:12 ? 0:01
ora_dbw0_XXXX
oracle 424 1 0 14:02:12 ? 0:01
ora_lgwr_XXXX
oracle 426 1 0 14:02:13 ? 0:04
ora_ckpt_XXXX
oracle 428 1 0 14:02:13 ? 0:05
ora_smon_XXXX
oracle 430 1 0 14:02:13 ? 0:00
ora_reco_XXXX
The ? lead me to believe these processes are hung, the main issue is
that I cannot kill them b/c their parent process is pid 1 - init.
Any advice on how to proceed?