International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29B92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򛒀
9B480
򛒁
9B481
򛒂
9B482
򛒃
9B483
򛒄
9B484
򛒅
9B485
򛒆
9B486
򛒇
9B487
򛒈
9B488
򛒉
9B489
򛒊
9B48A
򛒋
9B48B
򛒌
9B48C
򛒍
9B48D
򛒎
9B48E
򛒏
9B48F
80
90
򛒐
9B490
򛒑
9B491
򛒒
9B492
򛒓
9B493
򛒔
9B494
򛒕
9B495
򛒖
9B496
򛒗
9B497
򛒘
9B498
򛒙
9B499
򛒚
9B49A
򛒛
9B49B
򛒜
9B49C
򛒝
9B49D
򛒞
9B49E
򛒟
9B49F
90
A0
򛒠
9B4A0
򛒡
9B4A1
򛒢
9B4A2
򛒣
9B4A3
򛒤
9B4A4
򛒥
9B4A5
򛒦
9B4A6
򛒧
9B4A7
򛒨
9B4A8
򛒩
9B4A9
򛒪
9B4AA
򛒫
9B4AB
򛒬
9B4AC
򛒭
9B4AD
򛒮
9B4AE
򛒯
9B4AF
A0
B0
򛒰
9B4B0
򛒱
9B4B1
򛒲
9B4B2
򛒳
9B4B3
򛒴
9B4B4
򛒵
9B4B5
򛒶
9B4B6
򛒷
9B4B7
򛒸
9B4B8
򛒹
9B4B9
򛒺
9B4BA
򛒻
9B4BB
򛒼
9B4BC
򛒽
9B4BD
򛒾
9B4BE
򛒿
9B4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]