|RHEL4 Spontaneous Reboot - We have an Oracle server running RHEL4 AS Update 5. It is running on a Dell 2970 and has 32 GB of RAM. There are two Oracle9i instalces and one 10g instance, all running just fine. (Except for the pesky reboots for no reason). More pertinent info:|
[root@servername spool]# uname -a
Linux servername 2.6.9-55.ELsmp #1 SMP Fri Apr 20 16:36:54 EDT 2007 x86_64 x86_64 x86_64 GNU/Linux
[root@servername spool]# cat /etc/redhat-release
Red Hat Enterprise Linux AS release 4 (Nahant Update 5)
[root@servername spool]# free
total used free shared buffers cached
Mem: 32889764 32698256 191508 0 297168 31152780
-/+ buffers/cache: 1248308 31641456
Swap: 67108856 1520 67107336
About a year ago, we moved this from an IBM Blade with 8GB RAM tot he current hardware with more RAM and 64 Bit OS. There were no unexplained reboots while on the Blade Center platforms.
The server has a DRAC which shows nothing wrong with the hardware. Also, there are dual power supplies and a couple of PDU's which show nothing wrong in the logs. (Anyway, if there were a problem with the PDU's, it would have to be both and other servers would be affected, which they are not. ) Using the DRAC, you can see the last three boot sequences, (ten second snapshots), and the last three POSTs (Power on self test). One thing about the reboot, it shows the message that the system was shut down uncleanly, which is a common problem you see when the power is pulled. Only nothing anywhere indicates we have lost power.
And finally, the log files, (/var/log/messages, and such), do not show anything out of the ordinary until entries begin to appear indicating the system has rebooted.
It happens pretty rarely, but it does continue to happen.
Any help with this would be appreciated