International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2AD92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򭒀
AD480
򭒁
AD481
򭒂
AD482
򭒃
AD483
򭒄
AD484
򭒅
AD485
򭒆
AD486
򭒇
AD487
򭒈
AD488
򭒉
AD489
򭒊
AD48A
򭒋
AD48B
򭒌
AD48C
򭒍
AD48D
򭒎
AD48E
򭒏
AD48F
80
90
򭒐
AD490
򭒑
AD491
򭒒
AD492
򭒓
AD493
򭒔
AD494
򭒕
AD495
򭒖
AD496
򭒗
AD497
򭒘
AD498
򭒙
AD499
򭒚
AD49A
򭒛
AD49B
򭒜
AD49C
򭒝
AD49D
򭒞
AD49E
򭒟
AD49F
90
A0
򭒠
AD4A0
򭒡
AD4A1
򭒢
AD4A2
򭒣
AD4A3
򭒤
AD4A4
򭒥
AD4A5
򭒦
AD4A6
򭒧
AD4A7
򭒨
AD4A8
򭒩
AD4A9
򭒪
AD4AA
򭒫
AD4AB
򭒬
AD4AC
򭒭
AD4AD
򭒮
AD4AE
򭒯
AD4AF
A0
B0
򭒰
AD4B0
򭒱
AD4B1
򭒲
AD4B2
򭒳
AD4B3
򭒴
AD4B4
򭒵
AD4B5
򭒶
AD4B6
򭒷
AD4B7
򭒸
AD4B8
򭒹
AD4B9
򭒺
AD4BA
򭒻
AD4BB
򭒼
AD4BC
򭒽
AD4BD
򭒾
AD4BE
򭒿
AD4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]