International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F189AD

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񉭀
49B40
񉭁
49B41
񉭂
49B42
񉭃
49B43
񉭄
49B44
񉭅
49B45
񉭆
49B46
񉭇
49B47
񉭈
49B48
񉭉
49B49
񉭊
49B4A
񉭋
49B4B
񉭌
49B4C
񉭍
49B4D
񉭎
49B4E
񉭏
49B4F
80
90
񉭐
49B50
񉭑
49B51
񉭒
49B52
񉭓
49B53
񉭔
49B54
񉭕
49B55
񉭖
49B56
񉭗
49B57
񉭘
49B58
񉭙
49B59
񉭚
49B5A
񉭛
49B5B
񉭜
49B5C
񉭝
49B5D
񉭞
49B5E
񉭟
49B5F
90
A0
񉭠
49B60
񉭡
49B61
񉭢
49B62
񉭣
49B63
񉭤
49B64
񉭥
49B65
񉭦
49B66
񉭧
49B67
񉭨
49B68
񉭩
49B69
񉭪
49B6A
񉭫
49B6B
񉭬
49B6C
񉭭
49B6D
񉭮
49B6E
񉭯
49B6F
A0
B0
񉭰
49B70
񉭱
49B71
񉭲
49B72
񉭳
49B73
񉭴
49B74
񉭵
49B75
񉭶
49B76
񉭷
49B77
񉭸
49B78
񉭹
49B79
񉭺
49B7A
񉭻
49B7B
񉭼
49B7C
񉭽
49B7D
񉭾
49B7E
񉭿
49B7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]