unicode: Support characters beyond the first unicode plane

We used 16-bit variables to store the 'character code' everywhere but this won't let us represent anything beyond U+FFFF. This patch changes those variables to a custom type that can be 32 or 16 bits depending on the build, and adjusts numerous internal APIs and datastructures to match. This includes: * utf8decode() and friends * font manipulation, caching, rendering, and generation * on-screen keyboard * FAT filesystem (parsing and generating utf16 LFNs) * WIN32 simulator platform code Note that this patch doesn't _enable_ >16bit unicode support; a followup patch will turn that on for appropriate targets. Appears to work on: * hosted linux, native, linux simulator in both 16/32-bit modes. Needs testing on: * windows and macos simulator (16bit+32bit) Change-Id: Iba111b27d2433019b6bff937cf1ebd2c4353a0e8
2025-12-08 12:45:26 -05:00 · 2024-12-17 08:55:21 -05:00 · 2024-12-17 08:55:21 -05:00 · a2c10f6189
commit a2c10f6189
parent 2a88253426
44 changed files with 476 additions and 330 deletions
--- a/apps/plugins/lib/simple_viewer.c
+++ b/apps/plugins/lib/simple_viewer.c
@ -62,7 +62,7 @@ static const char* get_next_line(const char *text, struct view_info *info)
    total = 0;
    while(*ptr)
    {
-        unsigned short ch;
+        ucschar_t ch;
        n = ((intptr_t)rb->utf8decode(ptr, &ch) - (intptr_t)ptr);
        if (rb->is_diacritic(ch, NULL))
            w = 0;