International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0A281

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𢁀
22040
𢁁
22041
𢁂
22042
𢁃
22043
𢁄
22044
𢁅
22045
𢁆
22046
𢁇
22047
𢁈
22048
𢁉
22049
𢁊
2204A
𢁋
2204B
𢁌
2204C
𢁍
2204D
𢁎
2204E
𢁏
2204F
80
90
𢁐
22050
𢁑
22051
𢁒
22052
𢁓
22053
𢁔
22054
𢁕
22055
𢁖
22056
𢁗
22057
𢁘
22058
𢁙
22059
𢁚
2205A
𢁛
2205B
𢁜
2205C
𢁝
2205D
𢁞
2205E
𢁟
2205F
90
A0
𢁠
22060
𢁡
22061
𢁢
22062
𢁣
22063
𢁤
22064
𢁥
22065
𢁦
22066
𢁧
22067
𢁨
22068
𢁩
22069
𢁪
2206A
𢁫
2206B
𢁬
2206C
𢁭
2206D
𢁮
2206E
𢁯
2206F
A0
B0
𢁰
22070
𢁱
22071
𢁲
22072
𢁳
22073
𢁴
22074
𢁵
22075
𢁶
22076
𢁷
22077
𢁸
22078
𢁹
22079
𢁺
2207A
𢁻
2207B
𢁼
2207C
𢁽
2207D
𢁾
2207E
𢁿
2207F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]