I found an instances of an ECC DIMM error in the logs just after reboot :
Wed Jun 1 05:17:02 GMT [cecc_log.entry:warning]: 1 Correctable ECC error on DIMM J41 at bit D55
On Wed, Jun 01, 2005 at 11:23:35AM -0700, Tavis Gustafson wrote:
Using ontap 6.4.5 with 1 ds14 over copper fc
Last night one of our F840's rebooted itself twice via "watchdog reset".
Either before or after the second reboot notification mail was sent the filer froze up with the front LCD panel stuck at some NFS ops/sec number. After a hard reboot it came up fine. I disabled the watchdog timer and the machine stayed up.
Has anyone experienced multiple watchdog timer resets or knows what type of hardware failure they watch? Also, how bad is it to keep the watchdog turned off?
Thanks, Tavis