[fm-discuss] Re: fault management error

Michael Shapiro mws at zion.eng.sun.com
Fri Dec 1 09:37:25 PST 2006


 
> I ran the following commands (after noticing that /var
> is full)
> 
> ######################################################
> bash$ fmadm faulty
>    STATE RESOURCE / UUID
> --------
> ----------------------------------------------------------------------
> degraded mem:///component=Slot,A:J3001
>          2f14680f-5aa7-cee7-b760-ccd72f1595f5
> --------
> ----------------------------------------------------------------------
> #########################################################
> 
> $ fmdump -v -u 2f14680f-5aa7-cee7-b760-ccd72f1595f5
> TIME                 UUID                             
>    SUNW-MSG-ID
> Nov 23 14:07:39.0443
> 2f14680f-5aa7-cee7-b760-ccd72f1595f5 SUN4U-8000-2S
>    95%  fault.memory.dimm
>          FRU: mem:///component=Slot,A:J3001
>         rsrc: mem:///component=Slot,A:J3001
> 
> 
> ###########################################################
> 
> I understand from this output that a memory slot is
> 'degraded'.
> 
> Please note that I do have a support contract with Sun
> but I need to know before opening a case if the
> problem is minor or can be fixed by a simple command.
> Also, will a simple reboot fix the problem??
> 
> Many thanks and regards,

The article at sun.com/msg/SUN4U-8000-2S will tell you what to do.
In this case you have a DIMM which has gone bad, identified by its
slot and J number location shown above.  You should shut down the
system when you can and replace that DIMM.  Run "fmadm faulty" on
a subsequent reboot to confirm that we detected the DIMM replacements;
on some legacy platforms we can't, so you'll need to run
"fmadm repair 2f14680f-5aa7-cee7-b760-ccd72f1595f5" afterward.

-Mike

-- 
Mike Shapiro, Solaris Kernel Development. blogs.sun.com/mws/



More information about the fm-discuss mailing list