Server crash - possible corrupt RAID? - can anyone help?
At some point over the weekend our MIS server which is running Server 2003 R2 crashed with a BSOD reporting a dump of physical memory. It did have the BSOD when I last shut it down but on power up last time it came back up fine so I didn't worry about it.
This time, it started by getting stuck at the screen showing the RAID arrangement. I could see all the drives looked ok but it would go no further. Powered down, reset all cables, memory, etc and on next boot it got to the Windows Server animated splash screen with the moving bar but after 10 minutes, no further. Tried last known good config and safe mode with no success. Get stuck at loading acpitabl.dat. Researched, could be various things but some say RAID or CMOS battery. Tried changing battery.
I take a backup of the entire server every night using NTBackup and do a weekly ASR. Used Recovery Console to try to restore the ASR but it seems to freeze at the 'examining XXXXMB disk space' part. It does the same if you go through the process of trying to re-install Windows or Recovery Console. Ideally I wanted to use RC to run chkdsk.
I'm suspecting either a corrupt RAID or possible faulty hard drive(s). I've noticed that only the first LED flashes lots out of the 4 drives (running RAID5 + spare) The rest are mostly solid.
I tried to test the memory to eliminate but test would have taken 6 hours so I’ve gone home now leaving it on the 'examining XXXXMB disk space' part.
If it's still like that in the morning I would guess I’m going to have to rebuild the RAID and try and recover from backup.
Has anyone got any advise/thoughts/suggestions?
If I select the option to rebuild the RAID, what does this actually do by default? Does it format the drives or does it try and recover any bad stripes, etc.
Thanks very much in advance for any help that can be offered.