FreeBSD 9.2 UFS + GELI softdep_deallocate_dependencies: unrecovered I/O error
Alexey Tarasov
me at lexasoft.ru
Tue Oct 29 07:17:44 UTC 2013
Hello.
Seems that setting kern.bio_transient_maxcnt to 8k resolved the problem.
On 27 окт. 2013 г., at 23:00, Alexey Tarasov <me at lexasoft.ru> wrote:
> Hello!
>
> Ok, I’ll try this.
> So this is software defect of FreeBSD 9.2?
>
> On 27 окт. 2013 г., at 22:46, Konstantin Belousov <kostikbel at gmail.com> wrote:
>
>> On Sat, Oct 26, 2013 at 01:47:18PM +0400, Alexey Tarasov wrote:
>>> Hello.
>>>
>>> I've upgraded server to 9.2 and now it hangs every 2-3 hours of intensive I/O to UFS SUJ + GELI disk. On 9.1 everything was good for a half of a year.
>>>
>>> g_vfs_done():da1.eli[WRITE(offset=614630752256, length=32768)]error = 11
>>> g_vfs_done():da1.eli[WRITE(offset=614631211008, length=32768)]error = 11
>>> g_vfs_done():da1.eli[WRITE(offset=614634815488, length=32768)]error = 11
>>> g_vfs_done():da1.eli[WRITE(offset=614642319360, length=32768)]error = 11
>>> g_vfs_done():da1.eli[WRITE(offset=614642909184, length=32768)]error = 11
>>> g_vfs_done():da1.eli[WRITE(offset=614643007488, length=32768)]error = 11
>>> g_vfs_done():da1.eli[WRITE(offset=614644875264, length=32768)]error = 11
>>> g_vfs_done():da1.eli[WRITE(offset=550691995648, length=98304)]error = 11
>>> g_vfs_done():da1.eli[WRITE(offset=550692519936, length=32768)]error = 11
>>> g_vfs_done():da1.eli[WRITE(offset=550704152576, length=32768)]error = 11
>>> /data/pgsql/data/base: got error 11 while accessing filesystem
>>> panic: softdep_deallocate_dependencies: unrecovered I/O error
>>> cpuid = 10
>>> KDB: stack backtrace:
>>> #0 0xffffffff80947986 at kdb_backtrace+0x66
>>> #1 0xffffffff8090d9ae at panic+0x1ce
>>> #2 0xffffffff80b3ff90 at clear_remove+0
>>> #3 0xffffffff8098fb65 at brelse+0x75
>>> #4 0xffffffff80990978 at bufdone+0x68
>>> #5 0xffffffff8098c83e at biodone+0xae
>>> #6 0xffffffff80872f4c at g_io_schedule_up+0xac
>>> #7 0xffffffff808736ac at g_up_procbody+0x5c
>>> #8 0xffffffff808db67f at fork_exit+0x11f
>>> #9 0xffffffff80cdc23e at fork_trampoline+0xe
>>> Uptime: 6d15h5m7s
>>> Dumping 7664 out of 196573 MB:..1%..11%..21%..31%..41%..51%..61%..71%..81%..91%
>>>
>>> Full core.txt is here: http://lexasoft.ru/core.txt.1
>>>
>>> Server is HP Proliant DL180 G6 with P410 RAID controller.
>>
>> Look for your current value of the kern.bio_transient_maxcnt and increase
>> it by 4-8 times, using the same tunable. If this helps, fine. If not,
>> disable unmapped i/o with the vfs.unmapped_buf_allowed tunable.
>>
>> Real solution is to convert geom classes like geli to use limited
>> transient mapping windows to access the data, thus adding support for
>> unmapped i/o to them.
>
> --
> Alexey Tarasov
>
> (\__/)
> (='.'=)
> E[: | | | | :]З
> (")_(")
>
--
Alexey Tarasov
(\__/)
(='.'=)
E[: | | | | :]З
(")_(")
More information about the freebsd-stable
mailing list