International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A0A0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󠠀
E0800
󠠁
E0801
󠠂
E0802
󠠃
E0803
󠠄
E0804
󠠅
E0805
󠠆
E0806
󠠇
E0807
󠠈
E0808
󠠉
E0809
󠠊
E080A
󠠋
E080B
󠠌
E080C
󠠍
E080D
󠠎
E080E
󠠏
E080F
80
90
󠠐
E0810
󠠑
E0811
󠠒
E0812
󠠓
E0813
󠠔
E0814
󠠕
E0815
󠠖
E0816
󠠗
E0817
󠠘
E0818
󠠙
E0819
󠠚
E081A
󠠛
E081B
󠠜
E081C
󠠝
E081D
󠠞
E081E
󠠟
E081F
90
A0
󠠠
E0820
󠠡
E0821
󠠢
E0822
󠠣
E0823
󠠤
E0824
󠠥
E0825
󠠦
E0826
󠠧
E0827
󠠨
E0828
󠠩
E0829
󠠪
E082A
󠠫
E082B
󠠬
E082C
󠠭
E082D
󠠮
E082E
󠠯
E082F
A0
B0
󠠰
E0830
󠠱
E0831
󠠲
E0832
󠠳
E0833
󠠴
E0834
󠠵
E0835
󠠶
E0836
󠠷
E0837
󠠸
E0838
󠠹
E0839
󠠺
E083A
󠠻
E083B
󠠼
E083C
󠠽
E083D
󠠾
E083E
󠠿
E083F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]