/orthography/

orthography

a work in progress collection of notes on orthography of Oceanic languages.

the purpose of these notes are not to assist in language learning but rather for editors or anyone dealing with these languages programatically. as of such, very little usage information is included and no pronounciation.

note that encoding here favours the use of combining diacritics rather than the precomposed unicode characters. since this site does not provide any fonts, be wary that the default sans-serif font may not support the necessary diacritics, or worse, supports them but renders them in odd places.

languages

Austronesian
Malayo-Polynesian
Eastern Malayo-Polynesian
Oceanic

general notes

when using with regex

normalisation of characters to either precomposed or sequences as per [UAX15] is necessary when the input’s preference of either one is not assured. composed forms are preferred during normalisation. see also [UTS18].