International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F388A0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󈠀
C8800
󈠁
C8801
󈠂
C8802
󈠃
C8803
󈠄
C8804
󈠅
C8805
󈠆
C8806
󈠇
C8807
󈠈
C8808
󈠉
C8809
󈠊
C880A
󈠋
C880B
󈠌
C880C
󈠍
C880D
󈠎
C880E
󈠏
C880F
80
90
󈠐
C8810
󈠑
C8811
󈠒
C8812
󈠓
C8813
󈠔
C8814
󈠕
C8815
󈠖
C8816
󈠗
C8817
󈠘
C8818
󈠙
C8819
󈠚
C881A
󈠛
C881B
󈠜
C881C
󈠝
C881D
󈠞
C881E
󈠟
C881F
90
A0
󈠠
C8820
󈠡
C8821
󈠢
C8822
󈠣
C8823
󈠤
C8824
󈠥
C8825
󈠦
C8826
󈠧
C8827
󈠨
C8828
󈠩
C8829
󈠪
C882A
󈠫
C882B
󈠬
C882C
󈠭
C882D
󈠮
C882E
󈠯
C882F
A0
B0
󈠰
C8830
󈠱
C8831
󈠲
C8832
󈠳
C8833
󈠴
C8834
󈠵
C8835
󈠶
C8836
󈠷
C8837
󈠸
C8838
󈠹
C8839
󈠺
C883A
󈠻
C883B
󈠼
C883C
󈠽
C883D
󈠾
C883E
󈠿
C883F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]