Re: Grep with non-ascii

From: George Mitchell <>
Date: Fri, 03 Feb 2023 17:36:47 UTC
On 2/3/23 11:06, Tomoaki AOKI wrote:
> [...]
> If this is the case like above, the only solution is to move to
> character set containing ALL characters all over the world.
> AFAIK, the only candidates are only two, TRON code [1] and Unicode (UCS,
> ISO/IEC 10646) [2]. And TRON code is very rarely used, actual candidate
> would be Unicode only.
> Note that Unicode is usually encoded to any of UTF-8, UTF-16 or UTF-32
> for data transfer (sometimes raw UCS-2?).
> [...]

The one positive development in the world of computing that I would
credit to Java is the earliest big push toward the adoption of UTF-8.
I strongly hope UTF-8 becomes universally used sooner rather than
later.                                                     -- George