International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38D83

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󍃀
CD0C0
󍃁
CD0C1
󍃂
CD0C2
󍃃
CD0C3
󍃄
CD0C4
󍃅
CD0C5
󍃆
CD0C6
󍃇
CD0C7
󍃈
CD0C8
󍃉
CD0C9
󍃊
CD0CA
󍃋
CD0CB
󍃌
CD0CC
󍃍
CD0CD
󍃎
CD0CE
󍃏
CD0CF
80
90
󍃐
CD0D0
󍃑
CD0D1
󍃒
CD0D2
󍃓
CD0D3
󍃔
CD0D4
󍃕
CD0D5
󍃖
CD0D6
󍃗
CD0D7
󍃘
CD0D8
󍃙
CD0D9
󍃚
CD0DA
󍃛
CD0DB
󍃜
CD0DC
󍃝
CD0DD
󍃞
CD0DE
󍃟
CD0DF
90
A0
󍃠
CD0E0
󍃡
CD0E1
󍃢
CD0E2
󍃣
CD0E3
󍃤
CD0E4
󍃥
CD0E5
󍃦
CD0E6
󍃧
CD0E7
󍃨
CD0E8
󍃩
CD0E9
󍃪
CD0EA
󍃫
CD0EB
󍃬
CD0EC
󍃭
CD0ED
󍃮
CD0EE
󍃯
CD0EF
A0
B0
󍃰
CD0F0
󍃱
CD0F1
󍃲
CD0F2
󍃳
CD0F3
󍃴
CD0F4
󍃵
CD0F5
󍃶
CD0F6
󍃷
CD0F7
󍃸
CD0F8
󍃹
CD0F9
󍃺
CD0FA
󍃻
CD0FB
󍃼
CD0FC
󍃽
CD0FD
󍃾
CD0FE
󍃿
CD0FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]