[Bug 275444] isprint() library function returns wrong when LC_CTYPE is ja_JP.SJIS (tcsh aborts by this)
Date: Sat, 02 Dec 2023 18:10:08 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=275444
--- Comment #3 from uratan <uratan@miomio.jp> ---
Created attachment 246729
--> https://bugs.freebsd.org/bugzilla/attachment.cgi?id=246729&action=edit
list of disappeared mappings
> Could you please try replacing
> tools/tools/locale/etc/final-maps/map.SJIS file with the one in attachment
> and doing `make all install` in share/ctypedef?
> It seems to fix the \ and ~ characters for me, but checking that
> it doesn't break anything else would be helpful.
I did so, and confirmed that tcsh/csh can printenv '~' char in
environment variable with the new LC_CTYPE:
+----------------------------------------
|-r--r--r-- 1 root wheel 4700 Dec 1 04:14
/usr/share/locale/ja_JP.SJIS/LC_CTYPE
|MD5 (/usr/share/locale/ja_JP.SJIS/LC_CTYPE) =
13bb935e7ba1275a263cf0074ceaa352
+----------------------------------------
- * - * -
I analyzed your new map file by text processing
because I am not familiar with either KANJI-codes
or LC_CTYPE with multibyte chars.
Here are files used:
+----------------------------------------
|MD5 (map.SJIS.orig) = 5a3ff0eb3cf5efe1fa9d44dc6a494e57 ...of 14.0R...[A]
|MD5 (map.SJIS.yuri) = 7adad9b105608e4fd16a5e9f00af7a43 ... yours ...[B]
+----------------------------------------
[A] has 11495 mappings, including 96 aliases for chars under 0x7f
and | <YEN_SIGN> \x5c
| <OVERLINE> \x7e
[B] has 7166 mappings, including same 96 aliases for chars under 0x7f
and | <REVERSE_SOLIDUS> \x5C
| <TILDE> \x7E
So [B] has less 4329 mappings than [A].
The attached file: "www-disappeared.txt" (handle as [C]) is
the list of disappeared mappings from [A] to [B], sorted by code.
It has 4331 entries including <YEN_SIGN> and <OVERLINE>.
I am using this web page as (easy) reference about SJIS code.
http://charset.7jp.net/sjis.html
Chars with light-blue-background in the bottom table are
not recommended to use by the author because
they can not be mapped to unicode,
including 'machine/OS dependent characters'.
I confirmed, by my eyes, disappeared mappings in [C] with the table,
they all are on light-blue-background.
So your [B] should be fine, I think.
--
You are receiving this mail because:
You are the assignee for the bug.