International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F489A2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􉢀
109880
􉢁
109881
􉢂
109882
􉢃
109883
􉢄
109884
􉢅
109885
􉢆
109886
􉢇
109887
􉢈
109888
􉢉
109889
􉢊
10988A
􉢋
10988B
􉢌
10988C
􉢍
10988D
􉢎
10988E
􉢏
10988F
80
90
􉢐
109890
􉢑
109891
􉢒
109892
􉢓
109893
􉢔
109894
􉢕
109895
􉢖
109896
􉢗
109897
􉢘
109898
􉢙
109899
􉢚
10989A
􉢛
10989B
􉢜
10989C
􉢝
10989D
􉢞
10989E
􉢟
10989F
90
A0
􉢠
1098A0
􉢡
1098A1
􉢢
1098A2
􉢣
1098A3
􉢤
1098A4
􉢥
1098A5
􉢦
1098A6
􉢧
1098A7
􉢨
1098A8
􉢩
1098A9
􉢪
1098AA
􉢫
1098AB
􉢬
1098AC
􉢭
1098AD
􉢮
1098AE
􉢯
1098AF
A0
B0
􉢰
1098B0
􉢱
1098B1
􉢲
1098B2
􉢳
1098B3
􉢴
1098B4
􉢵
1098B5
􉢶
1098B6
􉢷
1098B7
􉢸
1098B8
􉢹
1098B9
􉢺
1098BA
􉢻
1098BB
􉢼
1098BC
􉢽
1098BD
􉢾
1098BE
􉢿
1098BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]