International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18CB1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񌱀
4CC40
񌱁
4CC41
񌱂
4CC42
񌱃
4CC43
񌱄
4CC44
񌱅
4CC45
񌱆
4CC46
񌱇
4CC47
񌱈
4CC48
񌱉
4CC49
񌱊
4CC4A
񌱋
4CC4B
񌱌
4CC4C
񌱍
4CC4D
񌱎
4CC4E
񌱏
4CC4F
80
90
񌱐
4CC50
񌱑
4CC51
񌱒
4CC52
񌱓
4CC53
񌱔
4CC54
񌱕
4CC55
񌱖
4CC56
񌱗
4CC57
񌱘
4CC58
񌱙
4CC59
񌱚
4CC5A
񌱛
4CC5B
񌱜
4CC5C
񌱝
4CC5D
񌱞
4CC5E
񌱟
4CC5F
90
A0
񌱠
4CC60
񌱡
4CC61
񌱢
4CC62
񌱣
4CC63
񌱤
4CC64
񌱥
4CC65
񌱦
4CC66
񌱧
4CC67
񌱨
4CC68
񌱩
4CC69
񌱪
4CC6A
񌱫
4CC6B
񌱬
4CC6C
񌱭
4CC6D
񌱮
4CC6E
񌱯
4CC6F
A0
B0
񌱰
4CC70
񌱱
4CC71
񌱲
4CC72
񌱳
4CC73
񌱴
4CC74
񌱵
4CC75
񌱶
4CC76
񌱷
4CC77
񌱸
4CC78
񌱹
4CC79
񌱺
4CC7A
񌱻
4CC7B
񌱼
4CC7C
񌱽
4CC7D
񌱾
4CC7E
񌱿
4CC7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]