iscsi over HAST backed storage partial success

Kevin Day toasty at dragondata.com
Tue Apr 6 17:34:15 UTC 2010


On Mar 10, 2010, at 2:57 PM, Pawel Jakub Dawidek wrote:

> On Tue, Mar 09, 2010 at 05:03:41PM -0600, Kevin Day wrote:
>> 
>> I'm running istgt (iscsi target) using HAST backed storage. For the most part, it seems to work really well. I have ucarp running to change the IP that istgt is bound to, and modified the ucarp scripts to start/stop istgt depending on which side is the master. If I shut down the primary, the secondary takes over and all seems well.
>> 
>> However, if I reboot the secondary, the primary starts freezing up for long periods:
>> 
>> Mar  9 22:46:27 cs04 hastd: [iscsi1] (primary) Unable to r: Socket is not connected.
>> Mar  9 22:46:27 cs04 hastd: [iscsi1] (primary) Unable to co: Connection refused.
>> Mar  9 22:46:42 cs04 last message repeated 3 times
>> Mar  9 22:46:53 cs04 istgt[14298]: ABORT_TASK
>> Mar  9 22:47:35 cs04 last message repeated 3 times
>> Mar  9 22:48:02 cs04 hastd: [iscsi1] (primary) Unable to co: Operation timed out.
>> Mar  9 22:48:02 cs04 istgt[14298]: CmdSN(45748), OP=0x2a, ElapsedTime=74 cleared 
>> Mar  9 22:48:02 cs04 istgt[14298]: istgt_iscsi.c: 640:istgt_iscsi_write_pdu: ***ERROR*** iscsi_write() failed (errno=32)
>> Mar  9 22:48:02 cs04 istgt[14298]: istgt_iscsi.c:3327:istgt_iscsi_op_task: ***ERROR*** iscsi_write_pdu() failed
>> Mar  9 22:48:02 cs04 istgt[14298]: istgt_iscsi.c:3867:istgt_iscsi_execute: ***ERROR*** iscsi_op_task() failed  
>> Mar  9 22:48:02 cs04 istgt[14298]: istgt_iscsi.c:4337:worker: ***ERROR*** iscsi_execute() failed
>> Mar  9 22:48:02 cs04 istgt[14298]: CmdSN(490802), OP=0x2a, ElapsedTime=73 cleared
>> Mar  9 22:48:02 cs04 istgt[14298]: CmdSN(28387), OP=0x2a, ElapsedTime=73 cleared 
>> Mar  9 22:48:14 cs04 istgt[14298]: ABORT_TASK
>> Mar  9 22:48:52 cs04 last message repeated 2 times
>> Mar  9 22:49:22 cs04 hastd: [iscsi1] (primary) Unable to co: Operation timed out.
>> 
>> As soon as the secondary comes back online, everything starts behaving again and all is well.
> 
> Could you try the following patch?
> 
> 	http://people.freebsd.org/~pjd/patches/hastd_primary.c.patch
> 


Sorry for the long delay.

This does seem to fix that problem, yes. :)

-- Kevin



More information about the freebsd-fs mailing list