International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B8A0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󸠀
F8800
󸠁
F8801
󸠂
F8802
󸠃
F8803
󸠄
F8804
󸠅
F8805
󸠆
F8806
󸠇
F8807
󸠈
F8808
󸠉
F8809
󸠊
F880A
󸠋
F880B
󸠌
F880C
󸠍
F880D
󸠎
F880E
󸠏
F880F
80
90
󸠐
F8810
󸠑
F8811
󸠒
F8812
󸠓
F8813
󸠔
F8814
󸠕
F8815
󸠖
F8816
󸠗
F8817
󸠘
F8818
󸠙
F8819
󸠚
F881A
󸠛
F881B
󸠜
F881C
󸠝
F881D
󸠞
F881E
󸠟
F881F
90
A0
󸠠
F8820
󸠡
F8821
󸠢
F8822
󸠣
F8823
󸠤
F8824
󸠥
F8825
󸠦
F8826
󸠧
F8827
󸠨
F8828
󸠩
F8829
󸠪
F882A
󸠫
F882B
󸠬
F882C
󸠭
F882D
󸠮
F882E
󸠯
F882F
A0
B0
󸠰
F8830
󸠱
F8831
󸠲
F8832
󸠳
F8833
󸠴
F8834
󸠵
F8835
󸠶
F8836
󸠷
F8837
󸠸
F8838
󸠹
F8839
󸠺
F883A
󸠻
F883B
󸠼
F883C
󸠽
F883D
󸠾
F883E
󸠿
F883F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]