International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A3A0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򣠀
A3800
򣠁
A3801
򣠂
A3802
򣠃
A3803
򣠄
A3804
򣠅
A3805
򣠆
A3806
򣠇
A3807
򣠈
A3808
򣠉
A3809
򣠊
A380A
򣠋
A380B
򣠌
A380C
򣠍
A380D
򣠎
A380E
򣠏
A380F
80
90
򣠐
A3810
򣠑
A3811
򣠒
A3812
򣠓
A3813
򣠔
A3814
򣠕
A3815
򣠖
A3816
򣠗
A3817
򣠘
A3818
򣠙
A3819
򣠚
A381A
򣠛
A381B
򣠜
A381C
򣠝
A381D
򣠞
A381E
򣠟
A381F
90
A0
򣠠
A3820
򣠡
A3821
򣠢
A3822
򣠣
A3823
򣠤
A3824
򣠥
A3825
򣠦
A3826
򣠧
A3827
򣠨
A3828
򣠩
A3829
򣠪
A382A
򣠫
A382B
򣠬
A382C
򣠭
A382D
򣠮
A382E
򣠯
A382F
A0
B0
򣠰
A3830
򣠱
A3831
򣠲
A3832
򣠳
A3833
򣠴
A3834
򣠵
A3835
򣠶
A3836
򣠷
A3837
򣠸
A3838
򣠹
A3839
򣠺
A383A
򣠻
A383B
򣠼
A383C
򣠽
A383D
򣠾
A383E
򣠿
A383F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]