International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
UTR22 IBM WINDOWS JAVA IANA MIME Untagged Aliases All Aliases
UTF-8   ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
UTF-8
UTF-8 UTF-8 UTF-8 cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8
UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3AE92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󮒀
EE480
󮒁
EE481
󮒂
EE482
󮒃
EE483
󮒄
EE484
󮒅
EE485
󮒆
EE486
󮒇
EE487
󮒈
EE488
󮒉
EE489
󮒊
EE48A
󮒋
EE48B
󮒌
EE48C
󮒍
EE48D
󮒎
EE48E
󮒏
EE48F
80
90
󮒐
EE490
󮒑
EE491
󮒒
EE492
󮒓
EE493
󮒔
EE494
󮒕
EE495
󮒖
EE496
󮒗
EE497
󮒘
EE498
󮒙
EE499
󮒚
EE49A
󮒛
EE49B
󮒜
EE49C
󮒝
EE49D
󮒞
EE49E
󮒟
EE49F
90
A0
󮒠
EE4A0
󮒡
EE4A1
󮒢
EE4A2
󮒣
EE4A3
󮒤
EE4A4
󮒥
EE4A5
󮒦
EE4A6
󮒧
EE4A7
󮒨
EE4A8
󮒩
EE4A9
󮒪
EE4AA
󮒫
EE4AB
󮒬
EE4AC
󮒭
EE4AD
󮒮
EE4AE
󮒯
EE4AF
A0
B0
󮒰
EE4B0
󮒱
EE4B1
󮒲
EE4B2
󮒳
EE4B3
󮒴
EE4B4
󮒵
EE4B5
󮒶
EE4B6
󮒷
EE4B7
󮒸
EE4B8
󮒹
EE4B9
󮒺
EE4BA
󮒻
EE4BB
󮒼
EE4BC
󮒽
EE4BD
󮒾
EE4BE
󮒿
EE4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]