rockbox/firmware/arabjoin.h
Solomon Peachy d05c59f35b internals: Support characters beyond the first unicode plane (WIP)
We used 16-bit variables to store the 'character code' everywhere but
this won't let us represent anything beyond U+FFFF.

This patch changes those variables to a custom type that can be 32 or 16
bits depending on the build, and adjusts numerous internal APIs and
datastructures to match.  This includes:

 * utf8decode() and friends
 * on-screen keyboard
 * font manipulation, caching, rendering, and generation
 * VFAT code parses and generates utf16 dirents
 * WIN32 simulator reads and writes utf16 filenames

Note that this patch doesn't _enable_ >16bit unicode support; a followup
patch will turn that on for appropriate targets.

Known bugs:

  * Native players in 32-bit unicode mode generate mangled filename
    entries if they include UTF16 surrogate codepoints.  Root cause
    is unclear, and may reside in core dircache code.

Needs testing on:

 * windows simulator (16bit+32bit)

Change-Id: I193a00fe2a11a4181ddc82df2d71be52bf00b6e6
2025-08-20 16:08:43 -04:00

12 lines
428 B
C

/* Note these are not ucschar_t becuase all arabic
codepoints are <16bit, so no need to waste table space */
typedef struct {
unsigned short isolated;
unsigned short final;
unsigned short medial;
unsigned short initial;
} arab_t;
extern const arab_t zwj; /* zero-width joiner */
extern const arab_t lamaleph[]; /* lam-aleph ligatures */
extern const arab_t jointable[]; /* lookup table for arabic joining */