Why isn’t UTF-8 allowed as the “ANSI” code page?

The “ANSI” codepage is basically legacy: Windows 9X era. All modern software should be Unicode (that is, UTF-16) based anyway.

Basically, when the Ansi code page stuff was originally designed, UTF-8 wasn’t even invented and so support for multi-byte encodings was rather haphazard (i.e. most Ansi code pages are single byte, with the exception of some East Asian code pages which are one-or-two byte). Adding support for “proper” multi-byte encodings was probably deemed not worth the effort when all new development should be done in UTF-16 anyway.

Leave a Comment