International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48283

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􂃀
1020C0
􂃁
1020C1
􂃂
1020C2
􂃃
1020C3
􂃄
1020C4
􂃅
1020C5
􂃆
1020C6
􂃇
1020C7
􂃈
1020C8
􂃉
1020C9
􂃊
1020CA
􂃋
1020CB
􂃌
1020CC
􂃍
1020CD
􂃎
1020CE
􂃏
1020CF
80
90
􂃐
1020D0
􂃑
1020D1
􂃒
1020D2
􂃓
1020D3
􂃔
1020D4
􂃕
1020D5
􂃖
1020D6
􂃗
1020D7
􂃘
1020D8
􂃙
1020D9
􂃚
1020DA
􂃛
1020DB
􂃜
1020DC
􂃝
1020DD
􂃞
1020DE
􂃟
1020DF
90
A0
􂃠
1020E0
􂃡
1020E1
􂃢
1020E2
􂃣
1020E3
􂃤
1020E4
􂃥
1020E5
􂃦
1020E6
􂃧
1020E7
􂃨
1020E8
􂃩
1020E9
􂃪
1020EA
􂃫
1020EB
􂃬
1020EC
􂃭
1020ED
􂃮
1020EE
􂃯
1020EF
A0
B0
􂃰
1020F0
􂃱
1020F1
􂃲
1020F2
􂃳
1020F3
􂃴
1020F4
􂃵
1020F5
􂃶
1020F6
􂃷
1020F7
􂃸
1020F8
􂃹
1020F9
􂃺
1020FA
􂃻
1020FB
􂃼
1020FC
􂃽
1020FD
􂃾
1020FE
􂃿
1020FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]