International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09A98

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𚘀
1A600
𚘁
1A601
𚘂
1A602
𚘃
1A603
𚘄
1A604
𚘅
1A605
𚘆
1A606
𚘇
1A607
𚘈
1A608
𚘉
1A609
𚘊
1A60A
𚘋
1A60B
𚘌
1A60C
𚘍
1A60D
𚘎
1A60E
𚘏
1A60F
80
90
𚘐
1A610
𚘑
1A611
𚘒
1A612
𚘓
1A613
𚘔
1A614
𚘕
1A615
𚘖
1A616
𚘗
1A617
𚘘
1A618
𚘙
1A619
𚘚
1A61A
𚘛
1A61B
𚘜
1A61C
𚘝
1A61D
𚘞
1A61E
𚘟
1A61F
90
A0
𚘠
1A620
𚘡
1A621
𚘢
1A622
𚘣
1A623
𚘤
1A624
𚘥
1A625
𚘦
1A626
𚘧
1A627
𚘨
1A628
𚘩
1A629
𚘪
1A62A
𚘫
1A62B
𚘬
1A62C
𚘭
1A62D
𚘮
1A62E
𚘯
1A62F
A0
B0
𚘰
1A630
𚘱
1A631
𚘲
1A632
𚘳
1A633
𚘴
1A634
𚘵
1A635
𚘶
1A636
𚘷
1A637
𚘸
1A638
𚘹
1A639
𚘺
1A63A
𚘻
1A63B
𚘼
1A63C
𚘽
1A63D
𚘾
1A63E
𚘿
1A63F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]