International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0BBB1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𻱀
3BC40
𻱁
3BC41
𻱂
3BC42
𻱃
3BC43
𻱄
3BC44
𻱅
3BC45
𻱆
3BC46
𻱇
3BC47
𻱈
3BC48
𻱉
3BC49
𻱊
3BC4A
𻱋
3BC4B
𻱌
3BC4C
𻱍
3BC4D
𻱎
3BC4E
𻱏
3BC4F
80
90
𻱐
3BC50
𻱑
3BC51
𻱒
3BC52
𻱓
3BC53
𻱔
3BC54
𻱕
3BC55
𻱖
3BC56
𻱗
3BC57
𻱘
3BC58
𻱙
3BC59
𻱚
3BC5A
𻱛
3BC5B
𻱜
3BC5C
𻱝
3BC5D
𻱞
3BC5E
𻱟
3BC5F
90
A0
𻱠
3BC60
𻱡
3BC61
𻱢
3BC62
𻱣
3BC63
𻱤
3BC64
𻱥
3BC65
𻱦
3BC66
𻱧
3BC67
𻱨
3BC68
𻱩
3BC69
𻱪
3BC6A
𻱫
3BC6B
𻱬
3BC6C
𻱭
3BC6D
𻱮
3BC6E
𻱯
3BC6F
A0
B0
𻱰
3BC70
𻱱
3BC71
𻱲
3BC72
𻱳
3BC73
𻱴
3BC74
𻱵
3BC75
𻱶
3BC76
𻱷
3BC77
𻱸
3BC78
𻱹
3BC79
𻱺
3BC7A
𻱻
3BC7B
𻱼
3BC7C
𻱽
3BC7D
𻱾
3BC7E
𻱿
3BC7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]