International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1A798

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񧘀
67600
񧘁
67601
񧘂
67602
񧘃
67603
񧘄
67604
񧘅
67605
񧘆
67606
񧘇
67607
񧘈
67608
񧘉
67609
񧘊
6760A
񧘋
6760B
񧘌
6760C
񧘍
6760D
񧘎
6760E
񧘏
6760F
80
90
񧘐
67610
񧘑
67611
񧘒
67612
񧘓
67613
񧘔
67614
񧘕
67615
񧘖
67616
񧘗
67617
񧘘
67618
񧘙
67619
񧘚
6761A
񧘛
6761B
񧘜
6761C
񧘝
6761D
񧘞
6761E
񧘟
6761F
90
A0
񧘠
67620
񧘡
67621
񧘢
67622
񧘣
67623
񧘤
67624
񧘥
67625
񧘦
67626
񧘧
67627
񧘨
67628
񧘩
67629
񧘪
6762A
񧘫
6762B
񧘬
6762C
񧘭
6762D
񧘮
6762E
񧘯
6762F
A0
B0
񧘰
67630
񧘱
67631
񧘲
67632
񧘳
67633
񧘴
67634
񧘵
67635
񧘶
67636
񧘷
67637
񧘸
67638
񧘹
67639
񧘺
6763A
񧘻
6763B
񧘼
6763C
񧘽
6763D
񧘾
6763E
񧘿
6763F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]