International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29BA0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򛠀
9B800
򛠁
9B801
򛠂
9B802
򛠃
9B803
򛠄
9B804
򛠅
9B805
򛠆
9B806
򛠇
9B807
򛠈
9B808
򛠉
9B809
򛠊
9B80A
򛠋
9B80B
򛠌
9B80C
򛠍
9B80D
򛠎
9B80E
򛠏
9B80F
80
90
򛠐
9B810
򛠑
9B811
򛠒
9B812
򛠓
9B813
򛠔
9B814
򛠕
9B815
򛠖
9B816
򛠗
9B817
򛠘
9B818
򛠙
9B819
򛠚
9B81A
򛠛
9B81B
򛠜
9B81C
򛠝
9B81D
򛠞
9B81E
򛠟
9B81F
90
A0
򛠠
9B820
򛠡
9B821
򛠢
9B822
򛠣
9B823
򛠤
9B824
򛠥
9B825
򛠦
9B826
򛠧
9B827
򛠨
9B828
򛠩
9B829
򛠪
9B82A
򛠫
9B82B
򛠬
9B82C
򛠭
9B82D
򛠮
9B82E
򛠯
9B82F
A0
B0
򛠰
9B830
򛠱
9B831
򛠲
9B832
򛠳
9B833
򛠴
9B834
򛠵
9B835
򛠶
9B836
򛠷
9B837
򛠸
9B838
򛠹
9B839
򛠺
9B83A
򛠻
9B83B
򛠼
9B83C
򛠽
9B83D
򛠾
9B83E
򛠿
9B83F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]