International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38C9C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󌜀
CC700
󌜁
CC701
󌜂
CC702
󌜃
CC703
󌜄
CC704
󌜅
CC705
󌜆
CC706
󌜇
CC707
󌜈
CC708
󌜉
CC709
󌜊
CC70A
󌜋
CC70B
󌜌
CC70C
󌜍
CC70D
󌜎
CC70E
󌜏
CC70F
80
90
󌜐
CC710
󌜑
CC711
󌜒
CC712
󌜓
CC713
󌜔
CC714
󌜕
CC715
󌜖
CC716
󌜗
CC717
󌜘
CC718
󌜙
CC719
󌜚
CC71A
󌜛
CC71B
󌜜
CC71C
󌜝
CC71D
󌜞
CC71E
󌜟
CC71F
90
A0
󌜠
CC720
󌜡
CC721
󌜢
CC722
󌜣
CC723
󌜤
CC724
󌜥
CC725
󌜦
CC726
󌜧
CC727
󌜨
CC728
󌜩
CC729
󌜪
CC72A
󌜫
CC72B
󌜬
CC72C
󌜭
CC72D
󌜮
CC72E
󌜯
CC72F
A0
B0
󌜰
CC730
󌜱
CC731
󌜲
CC732
󌜳
CC733
󌜴
CC734
󌜵
CC735
󌜶
CC736
󌜷
CC737
󌜸
CC738
󌜹
CC739
󌜺
CC73A
󌜻
CC73B
󌜼
CC73C
󌜽
CC73D
󌜾
CC73E
󌜿
CC73F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]