[fm-discuss] Re: fault management error
Michael Shapiro
mws at zion.eng.sun.com
Fri Dec 1 09:37:25 PST 2006
> I ran the following commands (after noticing that /var
> is full)
>
> ######################################################
> bash$ fmadm faulty
> STATE RESOURCE / UUID
> --------
> ----------------------------------------------------------------------
> degraded mem:///component=Slot,A:J3001
> 2f14680f-5aa7-cee7-b760-ccd72f1595f5
> --------
> ----------------------------------------------------------------------
> #########################################################
>
> $ fmdump -v -u 2f14680f-5aa7-cee7-b760-ccd72f1595f5
> TIME UUID
> SUNW-MSG-ID
> Nov 23 14:07:39.0443
> 2f14680f-5aa7-cee7-b760-ccd72f1595f5 SUN4U-8000-2S
> 95% fault.memory.dimm
> FRU: mem:///component=Slot,A:J3001
> rsrc: mem:///component=Slot,A:J3001
>
>
> ###########################################################
>
> I understand from this output that a memory slot is
> 'degraded'.
>
> Please note that I do have a support contract with Sun
> but I need to know before opening a case if the
> problem is minor or can be fixed by a simple command.
> Also, will a simple reboot fix the problem??
>
> Many thanks and regards,
The article at sun.com/msg/SUN4U-8000-2S will tell you what to do.
In this case you have a DIMM which has gone bad, identified by its
slot and J number location shown above. You should shut down the
system when you can and replace that DIMM. Run "fmadm faulty" on
a subsequent reboot to confirm that we detected the DIMM replacements;
on some legacy platforms we can't, so you'll need to run
"fmadm repair 2f14680f-5aa7-cee7-b760-ccd72f1595f5" afterward.
-Mike
--
Mike Shapiro, Solaris Kernel Development. blogs.sun.com/mws/
More information about the fm-discuss
mailing list