International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A99B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򩛀
A96C0
򩛁
A96C1
򩛂
A96C2
򩛃
A96C3
򩛄
A96C4
򩛅
A96C5
򩛆
A96C6
򩛇
A96C7
򩛈
A96C8
򩛉
A96C9
򩛊
A96CA
򩛋
A96CB
򩛌
A96CC
򩛍
A96CD
򩛎
A96CE
򩛏
A96CF
80
90
򩛐
A96D0
򩛑
A96D1
򩛒
A96D2
򩛓
A96D3
򩛔
A96D4
򩛕
A96D5
򩛖
A96D6
򩛗
A96D7
򩛘
A96D8
򩛙
A96D9
򩛚
A96DA
򩛛
A96DB
򩛜
A96DC
򩛝
A96DD
򩛞
A96DE
򩛟
A96DF
90
A0
򩛠
A96E0
򩛡
A96E1
򩛢
A96E2
򩛣
A96E3
򩛤
A96E4
򩛥
A96E5
򩛦
A96E6
򩛧
A96E7
򩛨
A96E8
򩛩
A96E9
򩛪
A96EA
򩛫
A96EB
򩛬
A96EC
򩛭
A96ED
򩛮
A96EE
򩛯
A96EF
A0
B0
򩛰
A96F0
򩛱
A96F1
򩛲
A96F2
򩛳
A96F3
򩛴
A96F4
򩛵
A96F5
򩛶
A96F6
򩛷
A96F7
򩛸
A96F8
򩛹
A96F9
򩛺
A96FA
򩛻
A96FB
򩛼
A96FC
򩛽
A96FD
򩛾
A96FE
򩛿
A96FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]