Quote:Original post by Yann LQuote:There are about 41,000 characters in CJK extension B that fall outside the BMP.
AFAIK, you can perfectly well write CJK using only the BMP. But I may be wrong.
Why do you think that the Unicode standard includes CJK extension B? To take up space for no reason? There's a reason why I included this sentence in my post:
Quote:In particular, the Chinese government requires that computer systems properly implement many characters outside the BMP and has since at least around 2000.
Quote:If you specifically target the Chinese market, then the SIP might be required.SIP?
Quote:Can't disagree here. However there are many cases, especially for hobby programmers, where an 'intermediate' Unicode support level is entirely sufficient. Including a library such as ICU can be a daunting task for a beginner, and is not always justified.
I know where you're coming from, but just adding ICU to your application isn't much harder than adding a library like SDL or SFML, and these are tasks that we, in this forum, expect beginners and hobby programmers to be able to do.
In any case, this "intermediate" Unicode support level that you're advocating has the important disadvantages of only really existing on Windows machines (non-Windows compilers tend to use 32-bit wchar_t types) and relies on some of the most poorly documented features in the standard library. Fun challenge: try to find a list of locale names usable with MSVC.