广告位招租
广告位招租 联系qq:165345322

freeBSD6.2下ahd老报错的解决方法

时间:2008-05-08 19:36:44   来源:freebsdunix  作者:  点击:次  出处:技术无忧
关键字:freebsd adh hd1 linux

+ahd1: Recovery Initiated - Card was not paused
+>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
+ahd1: Dumping Card State at program address 0x6 Mode 0x33
+INTSTAT[0x0] SELOID[0x0] SELID[0x0] HS_MAILBOX[0x0]
+INTCTL[0x80]:(SWTMINTMASK) SEQINTSTAT[0x0] SAVED_MODE[0x11]
+DFFSTAT[0x33]:(CURRFIFO_NONE|FIFO0FREE|FIFO1FREE)
+SCSISIGI[0x0]:(P_DATAOUT) SCSIPHASE[0x0] SCSIBUS[0x0]
+LASTPHASE[0x1]:(P_DATAOUT|P_BUSFREE) SCSISEQ0[0x0]
+SCSISEQ1[0x12]:(ENAUTOATNP|ENRSELI) SEQCTL0[0x0] SEQINTCTL[0x0]
+SEQ_FLAGS[0x0] SEQ_FLAGS2[0x0] QFREEZE_COUNT[0x4]
+KERNEL_QFREEZE_COUNT[0x4] MK_MESSAGE_SCB[0xff00] MK_MESSAGE_SCSIID[0xff]
+SSTAT0[0x0] SSTAT1[0x8]:(BUSFREE) SSTAT2[0x0] SSTAT3[0x0]
+PERRDIAG[0x0] SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO)
+LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0]
+LQOSTAT1[0x0] LQOSTAT2[0xe1]:(LQOSTOP0|LQOPKT)
+
+SCB Count = 16 CMDS_PENDING = 1 LASTSCB 0xffff CURRSCB 0xf NEXTSCB 0xff00
+qinstart = 31383 qinfifonext = 31383
+QINFIFO:
+WAITING_TID_QUEUES:
+Pending list:
+  7 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x17]
+Total 1
+Kernel Free SCB list: 15 1 2 3 4 5 6 8 9 10 11 12 13 14 0
+Sequencer Complete DMA-inprog list:
+Sequencer Complete list:
+Sequencer DMA-Up and Complete list:
+Sequencer On QFreeze and Complete list:
+
+
+ahd1: FIFO0 Free, LONGJMP == 0x825e, SCB 0xf
+SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS)
+SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
+SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
+SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
+HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL)
+
+ahd1: FIFO1 Free, LONGJMP == 0x8063, SCB 0x7
+SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS)
+SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
+SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
+SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
+HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL)
+LQIN: 0x8 0x0 0x0 0xf 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
+ahd1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42
+ahd1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x1
+ahd1: SAVED_SCSIID = 0x0 SAVED_LUN = 0x0
+
+SIMODE0[0xc]:(ENOVERRUN|ENIOERR)
+CCSCBCTL[0x0]
+ahd1: REG0 == 0x7, SINDEX = 0x192, DINDEX = 0x102
+ahd1: SCBPTR == 0xf, SCB_NEXT == 0xff00, SCB_NEXT2 == 0xff25
+CDB 28 0 6 80 90 b6
+STACK: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
+<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
+(da1:ahd1:0:1:0): SCB 7 - timed out
+(da1:ahd1:0:1:0): Queuing a BDR SCB
+ahd1: ILLEGAL_PHASE 0x80
+ahd1:A:1:0: Attempt to issue message failed
+(da1:ahd1:0:1:0): no longer in timeout, status = 25b
+Copied 18 bytes of sense data offset 12: 0x70 0x0 0x6 0x0 0x0 0x0 0x0 0xa 0x0 0x0 0x0 0x0 0x29 0x1 0x1 0x0 0x0 0x0
+(da1:ahd1:0:1:0): WRITE(10). CDB: 2a 0 0 dd 3a 1f 0 0 20 0
+(da1:ahd1:0:1:0): CAM Status: SCSI Status Error
+(da1:ahd1:0:1:0): SCSI Status: Check Condition
+(da1:ahd1:0:1:0): UNIT ATTENTION asc:29,1
+(da1:ahd1:0:1:0): Power on occurred field replaceable unit: 1
+(da1:ahd1:0:1:0): Retrying Command (per Sense Data)

在出现错误的时候,系统近似死机。但过一段时间就可以正常工作。搜了很长时间,但是好象这个ahd很容易出问题。
  是不是硬盘的问题?

解:
ahd据我所知是最稳定的SCSI HBA之一,你遇到的这个问题看起来很象是硬盘(某些型号的Seagate的某些批次)和HBA配合的问题,有可能是在负载高的时候触发。尝试升级一下硬盘的firmware;如果做不了,尝试这个命令:

camcontrol tags da0 -N 32
camcontrol tags da1 -N 32

(根据硬盘数量类推)

如果有效,可以把这些配置加到 /etc/rc.early 里面。

还有就是检查数据线接触是否良好

此外,基于InfoTrend控制器的RAID阵列的某些版本也有类似的问题,通过配置tags可以绕过它。

 

请记住技术无忧网 中文域名 http://www.技术无忧.com


文章评论

共有 0 位网友发表了评论 此处只显示部分留言 点击查看完整评论页面

300x250广告位招租