[fm-discuss] many fmd errors on T2000
Robert Milkowski
rmilkowski at task.gda.pl
Tue Jun 20 12:27:22 PDT 2006
Hello Gavin,
Tuesday, June 20, 2006, 6:05:07 PM, you wrote:
GM> Hi Robert,
GM> On 06/19/06 10:57, Robert Milkowski wrote:
>> Hi.
>>
>> snv_39 sun4v (T2000)
>>
>> What are below errors about?
GM> [cut]
>> bash-3.00# fmdump -e| tail -10
>> Jun 19 08:52:08.9100 ereport.io.fire.pec.re
>> Jun 19 08:52:08.9116 ereport.io.fire.pec.btp
>> Jun 19 10:32:06.4630 ereport.io.fire.pec.re
>> Jun 19 10:32:06.4638 ereport.io.fire.pec.btp
>> Jun 19 10:32:37.5821 ereport.io.fire.pec.re
>> Jun 19 10:32:37.5828 ereport.io.fire.pec.btp
>> Jun 19 11:04:33.9641 ereport.io.fire.pec.re
>> Jun 19 11:04:33.9648 ereport.io.fire.pec.btp
>> Jun 19 11:07:04.6300 ereport.io.fire.pec.re
>> Jun 19 11:07:04.6316 ereport.io.fire.pec.btp
>> bash-3.00#
GM> My pcie-enabled colleague says:
GM> These are correctable errors on the PCIe link (receiver errors and Bad TLPs).
GM> Sounds like a bad link. The transactions are being retried successfully (for now).
GM> Actually the spec specifically says that if the Physical Layer detects a
GM> receiver error, then the Link Layer must not also report the error as a bad TLP,
GM> so there looks to be a chip error handling bug too.
GM> All those ereports should have resulted in some diagnosis. What does
GM> 'fmadm faulty' report, and what does 'fmdump -v' show?
bash-3.00# fmadm faulty
STATE RESOURCE / UUID
-------- ----------------------------------------------------------------------
bash-3.00# fmdump -v
TIME UUID SUNW-MSG-ID
fmdump: /var/fm/fmd/fltlog is empty
bash-3.00#
bash-3.00# fcinfo hba-port -l
HBA Port WWN: 210000e08b825c30
OS Device Name: /dev/cfg/c2
Manufacturer: QLogic Corp.
Model: 375-3108-xx
Firmware Version: 3.3.18
FCode/BIOS Version: 1
Type: L-port
State: online
Supported Speeds: 1Gb 2Gb
Current Speed: 2Gb
Node WWN: 200000e08b825c30
Link Error Statistics:
Link Failure Count: 0
Loss of Sync Count: 0
Loss of Signal Count: 0
Primitive Seq Protocol Error Count: 0
Invalid Tx Word Count: 0
Invalid CRC Count: 0
HBA Port WWN: 210100e08ba25c30
OS Device Name: /dev/cfg/c3
Manufacturer: QLogic Corp.
Model: 375-3108-xx
Firmware Version: 3.3.18
FCode/BIOS Version: 1
Type: L-port
State: online
Supported Speeds: 1Gb 2Gb
Current Speed: 2Gb
Node WWN: 200100e08ba25c30
Link Error Statistics:
Link Failure Count: 0
Loss of Sync Count: 0
Loss of Signal Count: 0
Primitive Seq Protocol Error Count: 0
Invalid Tx Word Count: 0
Invalid CRC Count: 0
bash-3.00#
Also nothing in logs.
--
Best regards,
Robert mailto:rmilkowski at task.gda.pl
http://milek.blogspot.com
More information about the fm-discuss
mailing list