International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
UTR22 IBM WINDOWS JAVA IANA MIME Untagged Aliases All Aliases
UTF-8   ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
UTF-8
UTF-8 UTF-8 UTF-8 cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8
UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3AE82

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󮂀
EE080
󮂁
EE081
󮂂
EE082
󮂃
EE083
󮂄
EE084
󮂅
EE085
󮂆
EE086
󮂇
EE087
󮂈
EE088
󮂉
EE089
󮂊
EE08A
󮂋
EE08B
󮂌
EE08C
󮂍
EE08D
󮂎
EE08E
󮂏
EE08F
80
90
󮂐
EE090
󮂑
EE091
󮂒
EE092
󮂓
EE093
󮂔
EE094
󮂕
EE095
󮂖
EE096
󮂗
EE097
󮂘
EE098
󮂙
EE099
󮂚
EE09A
󮂛
EE09B
󮂜
EE09C
󮂝
EE09D
󮂞
EE09E
󮂟
EE09F
90
A0
󮂠
EE0A0
󮂡
EE0A1
󮂢
EE0A2
󮂣
EE0A3
󮂤
EE0A4
󮂥
EE0A5
󮂦
EE0A6
󮂧
EE0A7
󮂨
EE0A8
󮂩
EE0A9
󮂪
EE0AA
󮂫
EE0AB
󮂬
EE0AC
󮂭
EE0AD
󮂮
EE0AE
󮂯
EE0AF
A0
B0
󮂰
EE0B0
󮂱
EE0B1
󮂲
EE0B2
󮂳
EE0B3
󮂴
EE0B4
󮂵
EE0B5
󮂶
EE0B6
󮂷
EE0B7
󮂸
EE0B8
󮂹
EE0B9
󮂺
EE0BA
󮂻
EE0BB
󮂼
EE0BC
󮂽
EE0BD
󮂾
EE0BE
󮂿
EE0BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]