Uppercase RE matching problems in FreeBSD 11

Baptiste Daroussin bapt at FreeBSD.org
Sun Nov 6 12:32:10 UTC 2016


On Sun, Nov 06, 2016 at 01:26:51PM +0100, Mark Martinec wrote:
> 2016-11-06 12:07, Baptiste Daroussin wrote:
> > Yes A-Z only means uppercase in an ASCII only world in a unicode world
> > it means
> > AaBb... Z because there are way more characters that simple A-Z. In
> > FreeBSD 11
> > we have a unicode collation instead of falling back in on LC_COLLATE=C
> > which
> > means ascii only
> > 
> > For regrexp for example one should use the classes: :upper: or :lower:.
> 
> It is a good idea to keep LC_COLLATE and LC_NUMERIC (and LC_MONETARY?) at
> "C"
> when LANG or LC_CTYPE is set to something else, otherwise unexpected
> things may happen.
> 

In scripts clearly, the collation rules, numeric rules and monetary rules may
vary depending on the locale.

Best regards,
Bapt
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 801 bytes
Desc: not available
URL: <http://lists.freebsd.org/pipermail/freebsd-stable/attachments/20161106/6f43bc0b/attachment.sig>


More information about the freebsd-stable mailing list