Re: Grep with non-ascii

From: George Mitchell <george+freebsd_at_m5p.com>
Date: Sat, 04 Feb 2023 13:36:02 UTC
On 2/3/23 23:16, Tomoaki AOKI wrote:
> [...]
> And FreeBSD already has UTF-8. ;-)
> 
> Drawbacks of UTF-8 are...
>    *Han unification. Not exactly same but lookalike characters in
>     Japanese, Chinese and Korean are fatally missingly unified.
> 
>    *Lack of proper support for variant forms of characters.
>     Maybe Unicode should have another 2 dimensions, one for classifying
>     wrongly unified CJK characters and another one for variants.

I confess that I don't know enough to comment on those.
> 
>    *Font sets. Very limited number of fonts covers the whole
>     Unicode codepoints that are assigned any of actual character.
> 
>    *FreeBSD base does not have full Unicode font for vt yet.
>     (Input methods are the different problem, though.)
> 
Yes, but FreeBSD is making progress on remedying these problems.  Many
fonts DO have support for the codepoints I need, though.  I think these
are less of a problem than the problems that UTF-8 solves.    -- George