International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B99C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𹜀
39700
𹜁
39701
𹜂
39702
𹜃
39703
𹜄
39704
𹜅
39705
𹜆
39706
𹜇
39707
𹜈
39708
𹜉
39709
𹜊
3970A
𹜋
3970B
𹜌
3970C
𹜍
3970D
𹜎
3970E
𹜏
3970F
80
90
𹜐
39710
𹜑
39711
𹜒
39712
𹜓
39713
𹜔
39714
𹜕
39715
𹜖
39716
𹜗
39717
𹜘
39718
𹜙
39719
𹜚
3971A
𹜛
3971B
𹜜
3971C
𹜝
3971D
𹜞
3971E
𹜟
3971F
90
A0
𹜠
39720
𹜡
39721
𹜢
39722
𹜣
39723
𹜤
39724
𹜥
39725
𹜦
39726
𹜧
39727
𹜨
39728
𹜩
39729
𹜪
3972A
𹜫
3972B
𹜬
3972C
𹜭
3972D
𹜮
3972E
𹜯
3972F
A0
B0
𹜰
39730
𹜱
39731
𹜲
39732
𹜳
39733
𹜴
39734
𹜵
39735
𹜶
39736
𹜷
39737
𹜸
39738
𹜹
39739
𹜺
3973A
𹜻
3973B
𹜼
3973C
𹜽
3973D
𹜾
3973E
𹜿
3973F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]