International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B39C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󳜀
F3700
󳜁
F3701
󳜂
F3702
󳜃
F3703
󳜄
F3704
󳜅
F3705
󳜆
F3706
󳜇
F3707
󳜈
F3708
󳜉
F3709
󳜊
F370A
󳜋
F370B
󳜌
F370C
󳜍
F370D
󳜎
F370E
󳜏
F370F
80
90
󳜐
F3710
󳜑
F3711
󳜒
F3712
󳜓
F3713
󳜔
F3714
󳜕
F3715
󳜖
F3716
󳜗
F3717
󳜘
F3718
󳜙
F3719
󳜚
F371A
󳜛
F371B
󳜜
F371C
󳜝
F371D
󳜞
F371E
󳜟
F371F
90
A0
󳜠
F3720
󳜡
F3721
󳜢
F3722
󳜣
F3723
󳜤
F3724
󳜥
F3725
󳜦
F3726
󳜧
F3727
󳜨
F3728
󳜩
F3729
󳜪
F372A
󳜫
F372B
󳜬
F372C
󳜭
F372D
󳜮
F372E
󳜯
F372F
A0
B0
󳜰
F3730
󳜱
F3731
󳜲
F3732
󳜳
F3733
󳜴
F3734
󳜵
F3735
󳜶
F3736
󳜷
F3737
󳜸
F3738
󳜹
F3739
󳜺
F373A
󳜻
F373B
󳜼
F373C
󳜽
F373D
󳜾
F373E
󳜿
F373F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]