orthography
a work in progress collection of notes on orthography of Oceanic languages.
the purpose of these notes are not to assist in language learning but rather for editors or anyone dealing with these languages programatically. as of such, very little usage information is included and no pronounciation.
note that encoding here favours the use of combining diacritics rather than the precomposed unicode characters. since this site does not provide any fonts, be wary that the default sans-serif font may not support the necessary diacritics, or worse, supports them but renders them in odd places.
languages
Austronesian
Malayo-Polynesian
Eastern Malayo-Polynesian
Oceanic
- Admiralty Islands...
- Central Pacific Linkage
- Tokalau Fijian
- Polynesian
- Nuclear Polynesian
- - Anuta
- - East Futuna
- East Uvean-Niuafaoʻou
- - East Uvean
- - Niuafoʻou
- - Niuatoputapu
- Ellicean
- Pukapukic
- - Pukapuka
- Samoan-Tokelauan
- Gagana faʻa Sāmoa (Sāmoan)
- - Tokelau
- - Tuvalu
- Pukapukic
- Northern Outlier Polynesian-East Polynesian
- Carolinean Outlier Polynesian
- - Kapingamarangi
- - Nukuoro
- Solomons Northern Outlier Polynesian-East Polynesian
- Central Northern Outlier Polynesian
- - Luangiua
- Takuuic
- - Nukumanu
- - Nukuria
- - Takuu
- East Polynesian
- East Polynesian Distal
- Far East Polynesian
- - Mangareva
- - Rapanui
- Marquesan
- - North Marquesan
- - South Marquesan
- Far East Polynesian
- East Polynesian Proximal
- - Hawaiian
- Southern East Polynesian Proximal
- - Mangaia-Old Rapa
- - Māngarongaro
- Maoric
- te reo Māori
- - te reo Moriori
- - Rakahanga-Manihiki
- - Southern Cook Island Maori
- Tahitian Austral
- - Austral
- - Tahitian
- - Tuamotuan
- East Polynesian Distal
- Central Northern Outlier Polynesian
- - Sikaiana
- Carolinean Outlier Polynesian
- - Rennell-Bellona
- - Tikopia
- - Vaeakau-Taumako
- Vanuatu-Loyalty Outliers
- - Emae
- Mele-Futuna
- - Futuna-Aniwa
- - Mele-fila
- - West Uvean
- Tongic
- - Nieuan
- - Tonga (Tonga Islands)
- Nuclear Polynesian
- Polynesian
- Tokalau Fijian
- Micronesian
- Central Micronesian
- Gilbertese
- Western Micronesian
- Chuukic-Ponapeic
- ...
- Kajin M̧ajeļ (Marshallese or Ebon; oldstyle: Kajin Majōl)
- Chuukic-Ponapeic
- Central Micronesian
- North and Central Vanuatu...
- Southeast Solomonic
- Guadalcanal-Nggelic
- Nuclear Guadalcanal-Nggelic
- Nggelic
- - Bughotu
- - Gela
- North and West Guadalcanal
- - Ghari
- - Lengo
- - Malango
- Nggelic
- Southeast Guadalcanal
- - Birao
- - Talise
- Nuclear Guadalcanal-Nggelic
- Longgu-Malaita-Makira
- - Longgu
- Malaita-Makira
- Makira
- - Arosi
- - Bauro
- - Fagani
- - Kahua
- - Owa
- Malaita
- Central-Northern Malaita
- - Gula'alaa
- - Kwaio
- - Kwara'ae
- North Malaitan
- - Baeggu
- - Baelelea
- - Fataleka
- - Lau
- - To'abaita
- - Wala
- Southern Malaita
- ꞋAreꞌare
- - Dori'o
- - Oroha
- Central-Northern Malaita
- Sa'a
- Makira
- Guadalcanal-Nggelic
- Southern Melanesian...
- St. Matthias...
- Temotu...
- Western Oceanic linkage...
- Yapesic
- - Nguluwan
- - Yapese
general notes
when using with regex
normalisation of characters to either precomposed or sequences as per [UAX15] is necessary when the input’s preference of either one is not assured. composed forms are preferred during normalisation. see also [UTS18].