International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-16BE

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA MIME All Aliases
UTF-16BE UTF-16BE UTF-16BE UTF-16BE
x-utf-16be
UnicodeBigUnmarked
ibm-1200
ibm-1201
ibm-13488
ibm-13489
ibm-17584
ibm-17585
ibm-21680
ibm-21681
ibm-25776
ibm-25777
ibm-29872
ibm-29873
ibm-61955
ibm-61956
windows-1201
cp1200
cp1201
UTF16_BigEndian

Codepage layout information is not available for this converter at this time.


Information About This Converter
Type of converterUCNV_UTF16_BigEndian
Minimum number of bytes per UChar2
Maximum number of bytes per UChar2
Substitution character\xFF\xFD (See note below)
Is ASCII [\x20-\x7E] compatible?FALSE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
LocaleLocale Name
afAfrikaans
agqAghem
akAkan
amAmharic
arArabic
asAssamese
asaAsu
astAsturian
azAzerbaijani
az_CyrlAzerbaijani (Cyrillic)
basBasaa
beBelarusian
bemBemba
bezBena
bgBulgarian
bmBambara
bnBangla
boTibetan
brBreton
brxBodo
bsBosnian
bs_CyrlBosnian (Cyrillic)
caCatalan
ccpChakma
ceChechen
cebCebuano
cggChiga
chrCherokee
ckbCentral Kurdish
csCzech
cyWelsh
daDanish
davTaita
deGerman
de_CHGerman (Switzerland)
djeZarma
doiDogri
dsbLower Sorbian
duaDuala
dyoJola-Fonyi
dzDzongkha
ebuEmbu
eeEwe
elGreek
enEnglish
eoEsperanto
esSpanish
etEstonian
euBasque
ewoEwondo
faPersian
ffFulah
ff_AdlmFulah (Adlam)
fiFinnish
filFilipino
foFaroese
frFrench
furFriulian
fyWestern Frisian
gaIrish
gdScottish Gaelic
glGalician
gswSwiss German
guGujarati
guzGusii
gvManx
haHausa
hawHawaiian
heHebrew
hiHindi
hrCroatian
hsbUpper Sorbian
huHungarian
hyArmenian
iaInterlingua
idIndonesian
igIgbo
iiSichuan Yi
isIcelandic
itItalian
jaJapanese
jgoNgomba
jmcMachame
jvJavanese
kaGeorgian
kabKabyle
kamKamba
kdeMakonde
keaKabuverdianu
kgpKaingang
khqKoyra Chiini
kiKikuyu
kkKazakh
kkjKako
klKalaallisut
klnKalenjin
kmKhmer
knKannada
koKorean
kokKonkani
ksKashmiri
ks_DevaKashmiri (Devanagari)
ksbShambala
ksfBafia
kshColognian
kuKurdish
kwCornish
kyKyrgyz
lagLangi
lbLuxembourgish
lgGanda
lktLakota
lnLingala
loLao
lrcNorthern Luri
ltLithuanian
luLuba-Katanga
luoLuo
luyLuyia
lvLatvian
maiMaithili
masMasai
merMeru
mfeMorisyen
mgMalagasy
mghMakhuwa-Meetto
mgoMetaʼ
miMāori
mkMacedonian
mlMalayalam
mnMongolian
mniManipuri
mrMarathi
msMalay
mtMaltese
muaMundang
myBurmese
mznMazanderani
naqNama
ndNorth Ndebele
neNepali
nlDutch
nmgKwasio
nnhNgiemboon
noNorwegian
nusNuer
nynNyankole
omOromo
orOdia
osOssetic
paPunjabi
pa_ArabPunjabi (Arabic)
pcmNigerian Pidgin
plPolish
psPashto
ps_PKPashto (Pakistan)
ptPortuguese
quQuechua
rmRomansh
rnRundi
roRomanian
rofRombo
ruRussian
rwKinyarwanda
rwkRwa
saSanskrit
sahSakha
saqSamburu
satSantali
sbpSangu
scSardinian
sdSindhi
sd_DevaSindhi (Devanagari)
seNorthern Sami
sehSena
sesKoyraboro Senni
sgSango
shiTachelhit
shi_LatnTachelhit (Latin)
siSinhala
skSlovak
slSlovenian
smnInari Sami
snShona
soSomali
sqAlbanian
srSerbian
sr_LatnSerbian (Latin)
suSundanese
svSwedish
swSwahili
sw_CDSwahili (Congo - Kinshasa)
sw_KESwahili (Kenya)
taTamil
teTelugu
teoTeso
tgTajik
thThai
tiTigrinya
tkTurkmen
toTongan
trTurkish
ttTatar
twqTasawaq
tzmCentral Atlas Tamazight
ugUyghur
ukUkrainian
urUrdu
uzUzbek
uz_ArabUzbek (Arabic)
uz_CyrlUzbek (Cyrillic)
vaiVai
vai_LatnVai (Latin)
viVietnamese
vunVunjo
waeWalser
woWolof
xhXhosa
xogSoga
yavYangben
yiYiddish
yoYoruba
yo_BJYoruba (Benin)
yrlNheengatu
yueCantonese
yue_HansCantonese (Simplified)
zghStandard Moroccan Tamazight
zhChinese
zh_HantChinese (Traditional)
zuZulu

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]

Note: The substitution byte sequence can be platform dependent. It depends on the endianess of the platform. Please see the Unicode FAQ for details.