International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B998

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󹘀
F9600
󹘁
F9601
󹘂
F9602
󹘃
F9603
󹘄
F9604
󹘅
F9605
󹘆
F9606
󹘇
F9607
󹘈
F9608
󹘉
F9609
󹘊
F960A
󹘋
F960B
󹘌
F960C
󹘍
F960D
󹘎
F960E
󹘏
F960F
80
90
󹘐
F9610
󹘑
F9611
󹘒
F9612
󹘓
F9613
󹘔
F9614
󹘕
F9615
󹘖
F9616
󹘗
F9617
󹘘
F9618
󹘙
F9619
󹘚
F961A
󹘛
F961B
󹘜
F961C
󹘝
F961D
󹘞
F961E
󹘟
F961F
90
A0
󹘠
F9620
󹘡
F9621
󹘢
F9622
󹘣
F9623
󹘤
F9624
󹘥
F9625
󹘦
F9626
󹘧
F9627
󹘨
F9628
󹘩
F9629
󹘪
F962A
󹘫
F962B
󹘬
F962C
󹘭
F962D
󹘮
F962E
󹘯
F962F
A0
B0
󹘰
F9630
󹘱
F9631
󹘲
F9632
󹘳
F9633
󹘴
F9634
󹘵
F9635
󹘶
F9636
󹘷
F9637
󹘸
F9638
󹘹
F9639
󹘺
F963A
󹘻
F963B
󹘼
F963C
󹘽
F963D
󹘾
F963E
󹘿
F963F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]