International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1B5A0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񵠀
75800
񵠁
75801
񵠂
75802
񵠃
75803
񵠄
75804
񵠅
75805
񵠆
75806
񵠇
75807
񵠈
75808
񵠉
75809
񵠊
7580A
񵠋
7580B
񵠌
7580C
񵠍
7580D
񵠎
7580E
񵠏
7580F
80
90
񵠐
75810
񵠑
75811
񵠒
75812
񵠓
75813
񵠔
75814
񵠕
75815
񵠖
75816
񵠗
75817
񵠘
75818
񵠙
75819
񵠚
7581A
񵠛
7581B
񵠜
7581C
񵠝
7581D
񵠞
7581E
񵠟
7581F
90
A0
񵠠
75820
񵠡
75821
񵠢
75822
񵠣
75823
񵠤
75824
񵠥
75825
񵠦
75826
񵠧
75827
񵠨
75828
񵠩
75829
񵠪
7582A
񵠫
7582B
񵠬
7582C
񵠭
7582D
񵠮
7582E
񵠯
7582F
A0
B0
񵠰
75830
񵠱
75831
񵠲
75832
񵠳
75833
񵠴
75834
񵠵
75835
񵠶
75836
񵠷
75837
񵠸
75838
񵠹
75839
񵠺
7583A
񵠻
7583B
񵠼
7583C
񵠽
7583D
񵠾
7583E
񵠿
7583F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]