[fm-discuss] many fmd errors on T2000

Robert Milkowski rmilkowski at task.gda.pl
Tue Jun 20 12:27:22 PDT 2006


Hello Gavin,

Tuesday, June 20, 2006, 6:05:07 PM, you wrote:

GM> Hi Robert,

GM> On 06/19/06 10:57, Robert Milkowski wrote:
>> Hi.
>> 
>> snv_39 sun4v (T2000)
>> 
>> What are below errors about?
GM> [cut]
>> bash-3.00# fmdump -e| tail -10
>> Jun 19 08:52:08.9100 ereport.io.fire.pec.re
>> Jun 19 08:52:08.9116 ereport.io.fire.pec.btp
>> Jun 19 10:32:06.4630 ereport.io.fire.pec.re
>> Jun 19 10:32:06.4638 ereport.io.fire.pec.btp
>> Jun 19 10:32:37.5821 ereport.io.fire.pec.re
>> Jun 19 10:32:37.5828 ereport.io.fire.pec.btp
>> Jun 19 11:04:33.9641 ereport.io.fire.pec.re
>> Jun 19 11:04:33.9648 ereport.io.fire.pec.btp
>> Jun 19 11:07:04.6300 ereport.io.fire.pec.re
>> Jun 19 11:07:04.6316 ereport.io.fire.pec.btp
>> bash-3.00#

GM> My pcie-enabled colleague says:

GM>         These are correctable errors on the PCIe link (receiver errors and Bad TLPs).
GM>         Sounds like a bad link. The transactions are being retried successfully (for now).

GM>         Actually the spec specifically says that if the Physical Layer detects a
GM>         receiver error, then the Link Layer must not also report the error as a bad TLP,
GM>         so there looks to be a chip error handling bug too.

GM> All those ereports should have resulted in some diagnosis.  What does
GM> 'fmadm faulty' report, and what does 'fmdump -v' show?


bash-3.00# fmadm faulty
   STATE RESOURCE / UUID
-------- ----------------------------------------------------------------------
bash-3.00# fmdump -v
TIME                 UUID                                 SUNW-MSG-ID
fmdump: /var/fm/fmd/fltlog is empty
bash-3.00#


bash-3.00# fcinfo hba-port -l
HBA Port WWN: 210000e08b825c30
        OS Device Name: /dev/cfg/c2
        Manufacturer: QLogic Corp.
        Model: 375-3108-xx
        Firmware Version: 3.3.18
        FCode/BIOS Version: 1
        Type: L-port
        State: online
        Supported Speeds: 1Gb 2Gb
        Current Speed: 2Gb
        Node WWN: 200000e08b825c30
        Link Error Statistics:
                Link Failure Count: 0
                Loss of Sync Count: 0
                Loss of Signal Count: 0
                Primitive Seq Protocol Error Count: 0
                Invalid Tx Word Count: 0
                Invalid CRC Count: 0
HBA Port WWN: 210100e08ba25c30
        OS Device Name: /dev/cfg/c3
        Manufacturer: QLogic Corp.
        Model: 375-3108-xx
        Firmware Version: 3.3.18
        FCode/BIOS Version: 1
        Type: L-port
        State: online
        Supported Speeds: 1Gb 2Gb
        Current Speed: 2Gb
        Node WWN: 200100e08ba25c30
        Link Error Statistics:
                Link Failure Count: 0
                Loss of Sync Count: 0
                Loss of Signal Count: 0
                Primitive Seq Protocol Error Count: 0
                Invalid Tx Word Count: 0
                Invalid CRC Count: 0
bash-3.00#


Also nothing in logs.


-- 
Best regards,
 Robert                            mailto:rmilkowski at task.gda.pl
                                       http://milek.blogspot.com




More information about the fm-discuss mailing list