printf(1) and UTF-8 multi-byte chars

Sun Oct 18 17:23:15 UTC 2020

On 18 Oct 2020 11:48:37 -0400
"John Levine" <johnl at iecc.com> wrote:

> I don't think there is any useful middle ground between counting bytes
> and full Unicode typesetting.

	There are good reasons for using all three levels, here are some:

Bytes: Content length headers, malloc calls - storage related
Glyphs: Truncation, apparent length, sorting - appearance related
Unicode Characters: UTF-8/16/32 conversions - encoding related

-- 
Steve O'Hara-Smith <steve at sohara.org>