International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48BA0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􋠀
10B800
􋠁
10B801
􋠂
10B802
􋠃
10B803
􋠄
10B804
􋠅
10B805
􋠆
10B806
􋠇
10B807
􋠈
10B808
􋠉
10B809
􋠊
10B80A
􋠋
10B80B
􋠌
10B80C
􋠍
10B80D
􋠎
10B80E
􋠏
10B80F
80
90
􋠐
10B810
􋠑
10B811
􋠒
10B812
􋠓
10B813
􋠔
10B814
􋠕
10B815
􋠖
10B816
􋠗
10B817
􋠘
10B818
􋠙
10B819
􋠚
10B81A
􋠛
10B81B
􋠜
10B81C
􋠝
10B81D
􋠞
10B81E
􋠟
10B81F
90
A0
􋠠
10B820
􋠡
10B821
􋠢
10B822
􋠣
10B823
􋠤
10B824
􋠥
10B825
􋠦
10B826
􋠧
10B827
􋠨
10B828
􋠩
10B829
􋠪
10B82A
􋠫
10B82B
􋠬
10B82C
􋠭
10B82D
􋠮
10B82E
􋠯
10B82F
A0
B0
􋠰
10B830
􋠱
10B831
􋠲
10B832
􋠳
10B833
􋠴
10B834
􋠵
10B835
􋠶
10B836
􋠷
10B837
􋠸
10B838
􋠹
10B839
􋠺
10B83A
􋠻
10B83B
􋠼
10B83C
􋠽
10B83D
􋠾
10B83E
􋠿
10B83F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]