International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2AB92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򫒀
AB480
򫒁
AB481
򫒂
AB482
򫒃
AB483
򫒄
AB484
򫒅
AB485
򫒆
AB486
򫒇
AB487
򫒈
AB488
򫒉
AB489
򫒊
AB48A
򫒋
AB48B
򫒌
AB48C
򫒍
AB48D
򫒎
AB48E
򫒏
AB48F
80
90
򫒐
AB490
򫒑
AB491
򫒒
AB492
򫒓
AB493
򫒔
AB494
򫒕
AB495
򫒖
AB496
򫒗
AB497
򫒘
AB498
򫒙
AB499
򫒚
AB49A
򫒛
AB49B
򫒜
AB49C
򫒝
AB49D
򫒞
AB49E
򫒟
AB49F
90
A0
򫒠
AB4A0
򫒡
AB4A1
򫒢
AB4A2
򫒣
AB4A3
򫒤
AB4A4
򫒥
AB4A5
򫒦
AB4A6
򫒧
AB4A7
򫒨
AB4A8
򫒩
AB4A9
򫒪
AB4AA
򫒫
AB4AB
򫒬
AB4AC
򫒭
AB4AD
򫒮
AB4AE
򫒯
AB4AF
A0
B0
򫒰
AB4B0
򫒱
AB4B1
򫒲
AB4B2
򫒳
AB4B3
򫒴
AB4B4
򫒵
AB4B5
򫒶
AB4B6
򫒷
AB4B7
򫒸
AB4B8
򫒹
AB4B9
򫒺
AB4BA
򫒻
AB4BB
򫒼
AB4BC
򫒽
AB4BD
򫒾
AB4BE
򫒿
AB4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]