FreeBSD 9.2 UFS + GELI softdep_deallocate_dependencies: unrecovered I/O error

Alexey Tarasov me at lexasoft.ru
Tue Oct 29 07:17:44 UTC 2013


Hello.

Seems that setting kern.bio_transient_maxcnt to 8k resolved the problem.

On 27 окт. 2013 г., at 23:00, Alexey Tarasov <me at lexasoft.ru> wrote:

> Hello!
> 
> Ok, I’ll try this.
> So this is software defect of FreeBSD 9.2?
> 
> On 27 окт. 2013 г., at 22:46, Konstantin Belousov <kostikbel at gmail.com> wrote:
> 
>> On Sat, Oct 26, 2013 at 01:47:18PM +0400, Alexey Tarasov wrote:
>>> Hello. 
>>> 
>>> I've upgraded server to 9.2 and now it hangs every 2-3 hours of intensive I/O to UFS SUJ + GELI disk. On 9.1 everything was good for a half of a year. 
>>> 
>>> g_vfs_done():da1.eli[WRITE(offset=614630752256, length=32768)]error = 11 
>>> g_vfs_done():da1.eli[WRITE(offset=614631211008, length=32768)]error = 11 
>>> g_vfs_done():da1.eli[WRITE(offset=614634815488, length=32768)]error = 11 
>>> g_vfs_done():da1.eli[WRITE(offset=614642319360, length=32768)]error = 11 
>>> g_vfs_done():da1.eli[WRITE(offset=614642909184, length=32768)]error = 11 
>>> g_vfs_done():da1.eli[WRITE(offset=614643007488, length=32768)]error = 11 
>>> g_vfs_done():da1.eli[WRITE(offset=614644875264, length=32768)]error = 11 
>>> g_vfs_done():da1.eli[WRITE(offset=550691995648, length=98304)]error = 11 
>>> g_vfs_done():da1.eli[WRITE(offset=550692519936, length=32768)]error = 11 
>>> g_vfs_done():da1.eli[WRITE(offset=550704152576, length=32768)]error = 11 
>>> /data/pgsql/data/base: got error 11 while accessing filesystem 
>>> panic: softdep_deallocate_dependencies: unrecovered I/O error 
>>> cpuid = 10 
>>> KDB: stack backtrace: 
>>> #0 0xffffffff80947986 at kdb_backtrace+0x66 
>>> #1 0xffffffff8090d9ae at panic+0x1ce 
>>> #2 0xffffffff80b3ff90 at clear_remove+0 
>>> #3 0xffffffff8098fb65 at brelse+0x75 
>>> #4 0xffffffff80990978 at bufdone+0x68 
>>> #5 0xffffffff8098c83e at biodone+0xae 
>>> #6 0xffffffff80872f4c at g_io_schedule_up+0xac 
>>> #7 0xffffffff808736ac at g_up_procbody+0x5c 
>>> #8 0xffffffff808db67f at fork_exit+0x11f 
>>> #9 0xffffffff80cdc23e at fork_trampoline+0xe 
>>> Uptime: 6d15h5m7s 
>>> Dumping 7664 out of 196573 MB:..1%..11%..21%..31%..41%..51%..61%..71%..81%..91% 
>>> 
>>> Full core.txt is here: http://lexasoft.ru/core.txt.1 
>>> 
>>> Server is HP Proliant DL180 G6 with P410 RAID controller. 
>> 
>> Look for your current value of the kern.bio_transient_maxcnt and increase
>> it by 4-8 times, using the same tunable.  If this helps, fine.  If not,
>> disable unmapped i/o with the vfs.unmapped_buf_allowed tunable.
>> 
>> Real solution is to convert geom classes like geli to use limited
>> transient mapping windows to access the data, thus adding support for
>> unmapped i/o to them.
> 
> --
> Alexey Tarasov
> 
> (\__/) 
> (='.'=) 
> E[: | | | | :]З 
> (")_(")
> 

--
Alexey Tarasov

(\__/) 
(='.'=) 
E[: | | | | :]З 
(")_(")



More information about the freebsd-stable mailing list