International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38F83

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󏃀
CF0C0
󏃁
CF0C1
󏃂
CF0C2
󏃃
CF0C3
󏃄
CF0C4
󏃅
CF0C5
󏃆
CF0C6
󏃇
CF0C7
󏃈
CF0C8
󏃉
CF0C9
󏃊
CF0CA
󏃋
CF0CB
󏃌
CF0CC
󏃍
CF0CD
󏃎
CF0CE
󏃏
CF0CF
80
90
󏃐
CF0D0
󏃑
CF0D1
󏃒
CF0D2
󏃓
CF0D3
󏃔
CF0D4
󏃕
CF0D5
󏃖
CF0D6
󏃗
CF0D7
󏃘
CF0D8
󏃙
CF0D9
󏃚
CF0DA
󏃛
CF0DB
󏃜
CF0DC
󏃝
CF0DD
󏃞
CF0DE
󏃟
CF0DF
90
A0
󏃠
CF0E0
󏃡
CF0E1
󏃢
CF0E2
󏃣
CF0E3
󏃤
CF0E4
󏃥
CF0E5
󏃦
CF0E6
󏃧
CF0E7
󏃨
CF0E8
󏃩
CF0E9
󏃪
CF0EA
󏃫
CF0EB
󏃬
CF0EC
󏃭
CF0ED
󏃮
CF0EE
󏃯
CF0EF
A0
B0
󏃰
CF0F0
󏃱
CF0F1
󏃲
CF0F2
󏃳
CF0F3
󏃴
CF0F4
󏃵
CF0F5
󏃶
CF0F6
󏃷
CF0F7
󏃸
CF0F8
󏃹
CF0F9
󏃺
CF0FA
󏃻
CF0FB
󏃼
CF0FC
󏃽
CF0FD
󏃾
CF0FE
󏃿
CF0FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]