patch which implements ZFS LZ4 compression

Fabian Keil freebsd-listen at fabiankeil.de
Sun Feb 10 12:31:23 UTC 2013


Jeremy Chadwick <jdc at koitsu.org> wrote:

> On Sat, Feb 09, 2013 at 03:19:18PM +0100, Fabian Keil wrote:
> > Jeremy Chadwick <jdc at koitsu.org> wrote:
 
> > > If you want a PR for it, I'll file one, but all it's going to contain is
> > > the contents of this Email.
> > 
> > My impression is that your emails describe symptoms and contain
> > some speculation about what the cause might be. I didn't see any
> > sched traces, so it's unclear (to me) that priorities are actual
> > the problem.
> 
> They contain no speculation.
> 
> Bob Friesenhahn, who has a lot of experience and familiarity with ZFS on
> Solaris, seemed to know exactly the behaviour I described.  Others on
> FreeBSD have reported the same behaviour as well, just not in that
> thread circa 2011.

Similar symptoms can have different causes.
 
> Regarding "sched traces", please expand and include instructions.

I'm referring to the stuff that is fed into:
/usr/src/tools/sched/schedgraph.py

It can be created with ktrace and dtrace and I believe the
"documentation" is buried in the various "the scheduler sucks"
threads.

> > It's also unclear to me why the dedup and compression issues should
> > be related. There are lots of dedup performance issues reported for
> > Solaris as well and I doubt that they can be fixed for FreeBSD without
> > significantly deviating from the ZFS upstream.
> 
> What part of Bob's statement did you not understand?  Here, let me
> repeat it verbatim:
> 
> "Solaris solved the problem by putting the zfs writer threads into a 
> special scheduling class so that they are usually lower priority than 
> normal processing.  Before this change, a desktop system would become 
> almost unusable (intermittent loss of keyboard/mouse) while writing 
> lots of data with compression enabled.  Some NFS servers encountered 
> severe enough issues that NFS clients reported NFS timeouts."

My impression from reading zfs-discuss@ is that dedup performance
and some interactivity issues actually still exist in Illumos and
that they are completely unrelated to zfs writer threads.

As I can't use dedup on my systems I don't really pay attention to
them, though.

> > I'm not saying a PR would be useless, but in my experience PRs
> > with insufficient information just stay open and if the problem
> > isn't important enough for you to provide additional information
> > filing a PR is unlikely to have a great impact:
> > http://www.freebsd.org/cgi/query-pr-summary.cgi?category=&text=zfs
> 
> Then someone in the know needs to explain exactly *what* data would help
> and (more importantly) *how* to go about providing it (i.e. what to
> enable in the kernel, what commands to issue, etc.).  Eidan has
> repeatedly insisted that PRs are a Good Thing(tm) because they allow for
> an official way to track issues vs. mailing list threads that start and
> turn into tumbleweeds (just like the one I've referenced).

And how many of those PRs are actually solved?

This is a rhetoric question and I don't expect you to look it up.

I'm not saying that PRs are a bad thing, but filing PRs is the easy part
and in my experience issues that don't spark developer interest on the
mailing lists are usually also ignored when filed as PR, especially when
they don't contain 100% of the information that may be relevant.

Even if you provide "proof" that the priorities are indeed the cause
of the problem there's a fair chance that the PR gets ignored anyway.

I currently have four somewhat ZFS-related PRs open, the first was filed
in 2007.

I still don't think that the solution is that nobody works on ZFS
improvements until my PRs are solved.

I'm looking forward to using LZ4 which promises better compression than
lzjb with less interactivity impact than gzip. It might even work for
your /dev/random test as it's supposed to better deal with poorly
compressible data.

> Without those necessary instructions, in effect what you're asking me to
> do is "prove" that the problem exists, which I have already done so.
> You just don't like the data I've provided.

I don't expect you to "prove" that the problem exists.

My impression is that the interactivity issues with gzip have
been well known for years and exist since the ZFS import.

I also don't dislike your data, all I'm saying is that
there could be other explanations.

> Bottom line: people enable compression on an fs, issue large amounts of
> write I/O to that fs (say hundreds of megabytes, or gigabytes), and
> start to see the entire system intermittently stalling hard (for
> multiple seconds at a time).  This affects everything from switching VTs
> on physical console to packets going across SSH.  The stalls vary in
> duration depending on what compression type is used (lzjb vs. gzip-1 --
> I cannot even imagine what gzip-9 would be like).  I described it as
> verbosely as I could, including going back and "re-testing" because
> people felt the "ZFSv28 import might have addressed it" (it did not):
> 
> http://lists.freebsd.org/pipermail/freebsd-fs/2011-October/012752.html

I'm aware that the interactivity issues with gzip still exist with
ZFSv"5000" and don't deny that similar issues might exist with lzjb
(even though it works for me).

I don't think anyone is denying that gzip compression currently
works poorly for some work loads. Fixing it apparently just isn't
considered a priority and thus the issue still exists.

Fabian
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 196 bytes
Desc: not available
URL: <http://lists.freebsd.org/pipermail/freebsd-stable/attachments/20130210/38ea69be/attachment.sig>


More information about the freebsd-stable mailing list