International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F4869B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􆛀
1066C0
􆛁
1066C1
􆛂
1066C2
􆛃
1066C3
􆛄
1066C4
􆛅
1066C5
􆛆
1066C6
􆛇
1066C7
􆛈
1066C8
􆛉
1066C9
􆛊
1066CA
􆛋
1066CB
􆛌
1066CC
􆛍
1066CD
􆛎
1066CE
􆛏
1066CF
80
90
􆛐
1066D0
􆛑
1066D1
􆛒
1066D2
􆛓
1066D3
􆛔
1066D4
􆛕
1066D5
􆛖
1066D6
􆛗
1066D7
􆛘
1066D8
􆛙
1066D9
􆛚
1066DA
􆛛
1066DB
􆛜
1066DC
􆛝
1066DD
􆛞
1066DE
􆛟
1066DF
90
A0
􆛠
1066E0
􆛡
1066E1
􆛢
1066E2
􆛣
1066E3
􆛤
1066E4
􆛥
1066E5
􆛦
1066E6
􆛧
1066E7
􆛨
1066E8
􆛩
1066E9
􆛪
1066EA
􆛫
1066EB
􆛬
1066EC
􆛭
1066ED
􆛮
1066EE
􆛯
1066EF
A0
B0
􆛰
1066F0
􆛱
1066F1
􆛲
1066F2
􆛳
1066F3
􆛴
1066F4
􆛵
1066F5
􆛶
1066F6
􆛷
1066F7
􆛸
1066F8
􆛹
1066F9
􆛺
1066FA
􆛻
1066FB
􆛼
1066FC
􆛽
1066FD
􆛾
1066FE
􆛿
1066FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]