International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A998

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󩘀
E9600
󩘁
E9601
󩘂
E9602
󩘃
E9603
󩘄
E9604
󩘅
E9605
󩘆
E9606
󩘇
E9607
󩘈
E9608
󩘉
E9609
󩘊
E960A
󩘋
E960B
󩘌
E960C
󩘍
E960D
󩘎
E960E
󩘏
E960F
80
90
󩘐
E9610
󩘑
E9611
󩘒
E9612
󩘓
E9613
󩘔
E9614
󩘕
E9615
󩘖
E9616
󩘗
E9617
󩘘
E9618
󩘙
E9619
󩘚
E961A
󩘛
E961B
󩘜
E961C
󩘝
E961D
󩘞
E961E
󩘟
E961F
90
A0
󩘠
E9620
󩘡
E9621
󩘢
E9622
󩘣
E9623
󩘤
E9624
󩘥
E9625
󩘦
E9626
󩘧
E9627
󩘨
E9628
󩘩
E9629
󩘪
E962A
󩘫
E962B
󩘬
E962C
󩘭
E962D
󩘮
E962E
󩘯
E962F
A0
B0
󩘰
E9630
󩘱
E9631
󩘲
E9632
󩘳
E9633
󩘴
E9634
󩘵
E9635
󩘶
E9636
󩘷
E9637
󩘸
E9638
󩘹
E9639
󩘺
E963A
󩘻
E963B
󩘼
E963C
󩘽
E963D
󩘾
E963E
󩘿
E963F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]