International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F398A0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󘠀
D8800
󘠁
D8801
󘠂
D8802
󘠃
D8803
󘠄
D8804
󘠅
D8805
󘠆
D8806
󘠇
D8807
󘠈
D8808
󘠉
D8809
󘠊
D880A
󘠋
D880B
󘠌
D880C
󘠍
D880D
󘠎
D880E
󘠏
D880F
80
90
󘠐
D8810
󘠑
D8811
󘠒
D8812
󘠓
D8813
󘠔
D8814
󘠕
D8815
󘠖
D8816
󘠗
D8817
󘠘
D8818
󘠙
D8819
󘠚
D881A
󘠛
D881B
󘠜
D881C
󘠝
D881D
󘠞
D881E
󘠟
D881F
90
A0
󘠠
D8820
󘠡
D8821
󘠢
D8822
󘠣
D8823
󘠤
D8824
󘠥
D8825
󘠦
D8826
󘠧
D8827
󘠨
D8828
󘠩
D8829
󘠪
D882A
󘠫
D882B
󘠬
D882C
󘠭
D882D
󘠮
D882E
󘠯
D882F
A0
B0
󘠰
D8830
󘠱
D8831
󘠲
D8832
󘠳
D8833
󘠴
D8834
󘠵
D8835
󘠶
D8836
󘠷
D8837
󘠸
D8838
󘠹
D8839
󘠺
D883A
󘠻
D883B
󘠼
D883C
󘠽
D883D
󘠾
D883E
󘠿
D883F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]