Please help me redhat 9 aic7xxx

P. Larry Nelson lnelson at uiuc.edu
Mon Dec 15 14:52:00 PST 2003


jonathan soong wrote:
> 
> >>I hope someone can help me, i'm having trouble with aic7xxx connected to a
> >>Promise UltraTrak RM4000 external RAID 5 array on my redhat 9 box.
> >>
> >>
> >
> >Do you have the latest firmware for your UltraTrak?  If not, that
> >is where you should start.  I've seen countless bug reports against
> >these boxes that were resolved by updating the firmware.
> >
> Yup i saw the posts about firmware and that was the first thing i tried
> - i upgraded to the latest version before i did anything else (from
> memory, rev 1.0.0.17)..  :(. Besides this it was a brand new RedHat 9
> install. The SCSI card is an Adaptec 29160 Ultra160 SCSI Adapter
> connected to the Promise UltraTrak RM4000.
> 
> It currently is still not working and we're running off the internal IDE! :(

But have you reported this to Promise Tech Support?
I was running the latest firmware, too, and was having problems (different
from yours) until I emailed their tech support.  Two days later they sent
me a beta firmware that fixed my problem.

- Larry

> Original Messsage Follows:
> 
> Hi there
> 
> I hope someone can help me, i'm having trouble with aic7xxx connected to a
> Promise UltraTrak RM4000 external RAID 5 array on my redhat 9 box.
> 
> I have tried both the stock redhat9.0 drivers as well as the 6.3.3 drivers found
> at http://people.freebsd.org/~gibbs/linux/RPM/.
> 
> In both cases the module is loaded fine during boot up (i can see the module in
> lsmod, i can access the scsi disk using 'fdisk /dev/sda')
> 
> I then use fdisk to create a partition on /dev/sda - this is fine too.
> 
> When i try to format the partition (mkfs.ext3 /dev/sda1), the machine hangs on
> 'Writing inodes'.
> 
> After a while i get a whole bunch of SCSI errors. They are like:
>  <<<<<<<<<<<Dump Card State Ends >>>>>>>>>
>  (scsi0:A:0:0): SCB 0xf - timed out
>    sg[0] - Addr 0x57ea000 : Length 4096
>   Recovery SCB completes
>   scsi0: Issued Channel A Bus Reset. 32 SCBs aborted
> 
> -- SEE BELOW FOR BETTER PRINTOUT
> 
> Things i have tried:
> 1. RedHat 9 default aic7xxx drivers and 6.3.3 drivers (installed from RPM)
> 2. Updating FirmWare on Promise UltraTrak RM4000
> 3. Fedora (the 6.3.3 RPMs did not work)
> 4. Resetting the Adaptec to defaults (going thru the bios 'ctrl-a')
> 5. Resetting the machines bios defaults
> 6. 2 or 3 different SCSI cables
> 7. 2 or 3 different SCSI terminators
> 
> I would be most most grateful if anyone could shed some light on this
> situation.I have been at this for a couple of days now and its really starting
> to hurt!
> 
> Kind Regards
> 
> Jon
> 
> OUTPUT OF DMESG:
> 
>  SCSI subsystem driver Revision: 1.00
>  PCI: Found IRQ 10 for device 05:09.0
>  PCI: Sharing IRQ 10 with 00:1f.1
>  scsi0 : Adaptec AIC7XXX EISA/VLB/PCI SCSI HBA DRIVER, Rev 6.3.3
>         <Adaptec 29160 Ultra160 SCSI adapter>
>         aic7892: Ultra160 Wide Channel A, SCSI Id=7, 32/253 SCBs
> 
> BELOW IS WHAT COMES UP ON THE SCREEN ONCE I'VE STARTED mkfs.ext3 /dev/sda1 and
> it has paused during 'Writing inodes' (i've shorted some of it where the same
> message was being printed out):
> 
> Dec 10 17:48:58 ablettjnr kernel: scsi0: Recovery Initiated
> Dec 10 17:48:58 ablettjnr kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begins
> <<<<<<<<<<<<<<<<<
> Dec 10 17:48:58 ablettjnr kernel: scsi0: Dumping Card State while idle, at
> SEQADDR 0x9
> Dec 10 17:48:58 ablettjnr kernel: Card was paused
> Dec 10 17:48:58 ablettjnr kernel: ACCUM = 0x4, SINDEX = 0x64, DINDEX = 0x65,
> ARG_2 = 0x4
> Dec 10 17:48:58 ablettjnr kernel: HCNT = 0x0 SCBPTR = 0x19
> Dec 10 17:48:58 ablettjnr kernel: SCSIPHASE[0x0] SCSISIGI[0x0] ERROR[0x0]
> SCSIBUSL[0x0]
> Dec 10 17:48:58 ablettjnr kernel: LASTPHASE[0x1]:(P_BUSFREE)
> SCSISEQ[0x12]:(ENAUTOATNP|ENRSELI)
> Dec 10 17:48:58 ablettjnr kernel: SBLKCTL[0x6]:(SELWIDE|ENAB20) SCSIRATE[0x0]
> SEQCTL[0x10]:(FASTMODE)
> Dec 10 17:48:58 ablettjnr kernel: SEQ_FLAGS[0xc0]:(NO_CDB_SENT|NOT_IDENTIFIED)
> SSTAT0[0x0]
> Dec 10 17:48:58 ablettjnr kernel: SSTAT1[0x8]:(BUSFREE) SSTAT2[0x0] SSTAT3[0x0]
> SIMODE0[0x8]:(ENSWRAP)
> Dec 10 17:48:58 ablettjnr kernel: SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO)
> SXFRCTL0[0x80]:(DFON)
> Dec 10 17:48:58 ablettjnr kernel: DFCNTRL[0x0]
> DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
> Dec 10 17:48:58 ablettjnr kernel: STACK: 0x0 0x164 0x179 0x3
> Dec 10 17:48:58 ablettjnr kernel: SCB count = 35
> Dec 10 17:48:58 ablettjnr kernel: Kernel NEXTQSCB = 16
> Dec 10 17:48:58 ablettjnr kernel: Card NEXTQSCB = 16
> Dec 10 17:48:58 ablettjnr kernel: QINFIFO entries:
> Dec 10 17:48:58 ablettjnr kernel: Waiting Queue entries:
> Dec 10 17:48:58 ablettjnr kernel: Disconnected Queue entries: 25:17 24:18 23:19
> 22:10 21:11 20:12 19:13 18:14 17:5 16:6 15:7 14:8 13:9 12:0 11:4 9:2 10:1 8:3
> 7:32 6:33 5:34 4:25 3:26 1:27 2:28 0:29 31:20 30:21 29:22 28:23 27:24 26:15
> Dec 10 17:48:58 ablettjnr kernel: QOUTFIFO entries:
> Dec 10 17:48:58 ablettjnr kernel: Sequencer Free SCB List:
> Dec 10 17:48:58 ablettjnr kernel: Sequencer SCB Info:
> Dec 10 17:48:58 ablettjnr kernel:   0
> SCB_CONTROL[0x64]:(DISCONNECTED|TAG_ENB|DISCENB) SCB_SCSIID[0x7]
> Dec 10 17:48:58 ablettjnr kernel: SCB_LUN[0x0] SCB_TAG[0x1d]
> Dec 10 17:48:58 ablettjnr kernel:  15 SCB_CONTROL[0x60]:(TAG_ENB|DISCENB)
> SCB_SCSIID[0x7] SCB_LUN[0x0]
> 
> 
> **********NOTE:: the above lines repeated 100's times...
> 
> 
> Dec 10 17:48:58 ablettjnr kernel: Kernel Free SCB list: 31 30
> Dec 10 17:48:58 ablettjnr kernel: scsi0: Host Status: Failed(0)
> Dec 10 17:48:58 ablettjnr kernel: DevQ(0:0:0): 0 waiting
> Dec 10 17:48:58 ablettjnr kernel: DevQ(0:1:0): 0 waiting
> Dec 10 17:48:58 ablettjnr kernel:
> Dec 10 17:48:58 ablettjnr kernel: <<<<<<<<<<<<<<<<< Dump Card State Ends
> 
> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
> >>>>>>>>>>>>>>>>>>
> >>>>>>>>>>>>>>>>>>
> Dec 10 17:48:58 ablettjnr kernel: (scsi0:A:0:0): SCB 0xf - timed out
> Dec 10 17:48:58 ablettjnr kernel: sg[0] - Addr 0x57ea000 : Length 4096
> 
> 
> **********NOTE:: the above lines repeated 100's times... only the number and
> Addr change
> 
> 
> Dec 10 17:48:58 ablettjnr kernel: sg[101] - Addr 0x5778000 : Length 4096
> Dec 10 17:48:58 ablettjnr kernel: (scsi0:A:0:0): Queuing a BDR SCB
> Dec 10 17:48:58 ablettjnr kernel: (scsi0:A:0:0): Bus Device Reset Message Sent
> Dec 10 17:48:58 ablettjnr kernel: Recovery SCB completes
> Dec 10 17:48:58 ablettjnr kernel: scsi0: Bus Device Reset on A:0. 32 SCBs
> aborted
> Dec 10 17:49:58 ablettjnr kernel: 503b000 : Length 4096
> Dec 10 17:49:58 ablettjnr kernel: sg[12] - Addr 0x503a000 : Length 4096
> 
> 
> **********NOTE:: the above lines repeated 100's times... only the number and
> Addr change
> 
> 
> Dec 10 17:49:58 ablettjnr kernel: sg[98] - Addr 0x4fd8000 : Length 4096
> Dec 10 17:49:58 ablettjnr kernel: (scsi0:A:0:0): Other SCB Timeout
> Dec 10 17:49:58 ablettjnr kernel: (scsi0:A:0:0): SCB 0x1d - timed out
> 
> -------------------------------------------------
> This mail sent through IMP: http://horde.org/imp/
> 
> _______________________________________________
> aic7xxx at freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/aic7xxx
> To unsubscribe, send any mail to "aic7xxx-unsubscribe at freebsd.org"


-- 
P. Larry Nelson (217-244-9855) | Systems/Network Administrator
461 Loomis Lab                 | U of I, CITES Departmental Services
1110 W. Green St., Urbana, IL  | Consultant to: High Energy Physics Group
MailTo:lnelson at uiuc.edu        | http://www.uiuc.edu/ph/www/lnelson
-------------------------------------------------------------------------
 "Information without accountability is just noise."  - P.L. Nelson


More information about the aic7xxx mailing list