UTF-8 and wchar_t

Joerg Sonnenberger joerg at britannica.bec.de
Sun Feb 21 22:44:16 UTC 2010


Hi all,
reviewing some libarchive code I stumpled about the code that converts
UTF-8 to wide strings.  As done by a lot of other software, it currently
blindly assumes that wchar_t ~= UCS-4.  My question is whether FreeBSD
intentionally makes that decision what (and therefore should define
__STDC_ISO_10646__ according to ISO C99) or what correct way for reading
UTF-8 it allows.  Contrary to NetBSD, FreeBSD still lacks iconv(3)
support in base, so the usual approach of converting to the locale
charset and using mbtowc etc. is not possible.

Joerg

PS: Please keep me in CC.


More information about the freebsd-standards mailing list