Port of OpenBSD's sdiff

LI Xin delphij at delphij.net
Tue Jun 26 09:04:07 UTC 2007


Andrey Chernov wrote:
> On Tue, Jun 26, 2007 at 10:11:58AM +0200, Ollivier Robert wrote:
>> According to Xin LI:
>>> Our current implementation is slower than many other implementation,
>>> especially the BSD licensed PCRE.  This has in turn made a lot of our
>>> utilities slow.  For instance sed -e 's/^foo [0-9]{3} bar.+$/\1/g' seems
>>> to use O(N^2) time where N is the text being processed.
>> I'm currently looking into replacing our ancient library (based on H.
>> Spencer code from decades ago) with either PCRE (which is nicely BSD
>> licensed as you say) or the new code from Mr. Spencer (taken from Tcl or
>> postgresql) or even maybe Oniguruma, the new library used by Ruby.
>>
>> I agree, anything will be better than the one we have.
> 
> Please choose variant which supports multibyte characters for sure.

If memory serves me right, all of re libraries Ollivier has mentioned
supports multibyte characters.  Maybe we should create or find some test
cases to make sure there is no regression?

Cheers,
-- 
Xin LI <delphij at delphij.net>	http://www.delphij.net/
FreeBSD - The Power to Serve!

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 249 bytes
Desc: OpenPGP digital signature
Url : http://lists.freebsd.org/pipermail/freebsd-current/attachments/20070626/df5c3973/signature.pgp


More information about the freebsd-current mailing list