UTF-8 Sorting
Gerhard Schmidt
estartu at ze.tum.de
Wed Mar 12 09:51:38 UTC 2014
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hi,
I've a problem with FreeBSD, UTF-8 and Sorting.
e.g. there is a file with the following content
Meier
Müller
Öger
Ofner
Schmidt
I have set my Terminal to ISO-8859-1 Encoding and call sort on this
file I get the following output.
Meier
Müller
Ofner
Öger
Schmidt
Which is correctly sorted.
When i change my Terminal to UTF-8 encoding and convert the file to
UTF-8 and call sort again I get the following output.
Meier
Müller
Ofner
Schmidt
Öger
which is wrong.
The problem seams to be that the LC_COLLATE file in the de_DE.UTF-8
locale is linked to ../la_LN.US-ASCII/LC_COLLATE (as are all
LC_COLLATE Files in any UTF-8 locale).
After some Research i found a Mail from Kuba Lida in December 2008
(yeah that's 5 Years ago) stating the same Problem and got no response.
Why isn't there a UTF-8 LC_COLLATE file for any language. Kuba Lida
believed there was a Problem with multibyte collate files in FreeBSD.
Is this true and are there plans to fix this problem.
The same test under Linux works without problem.
Regards
Estartu
- --
-
---------------------------------------------------------------------------
Gerhard Schmidt | http://www.augusta.de/~estartu |
Fischbachweg 3 | | PGP Public Key
86856 Hiltenfingen | JabberID: estartu at augusta.de | on request
Germany | |
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1
Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/
iQCVAwUBUyApfwzx22nOTJQRAQJIbgP+MMSPepEsyG8Kx+QRDGJlfyQKK+r98/e+
ZiNPRMNjBpT7qrElJLvYfAuix3pOyqL3mq1DQJvZmqQxfoxEdy6GUf42i1Yk5gEX
T05YtaeVRoXK/TetFt0UEcC3bXuXheu63aBpO4FU2v8CPTAyBwU6DUvV/v3AzXr6
j+mwws5n7so=
=J2tH
-----END PGP SIGNATURE-----
More information about the freebsd-stable
mailing list