International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28B92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򋒀
8B480
򋒁
8B481
򋒂
8B482
򋒃
8B483
򋒄
8B484
򋒅
8B485
򋒆
8B486
򋒇
8B487
򋒈
8B488
򋒉
8B489
򋒊
8B48A
򋒋
8B48B
򋒌
8B48C
򋒍
8B48D
򋒎
8B48E
򋒏
8B48F
80
90
򋒐
8B490
򋒑
8B491
򋒒
8B492
򋒓
8B493
򋒔
8B494
򋒕
8B495
򋒖
8B496
򋒗
8B497
򋒘
8B498
򋒙
8B499
򋒚
8B49A
򋒛
8B49B
򋒜
8B49C
򋒝
8B49D
򋒞
8B49E
򋒟
8B49F
90
A0
򋒠
8B4A0
򋒡
8B4A1
򋒢
8B4A2
򋒣
8B4A3
򋒤
8B4A4
򋒥
8B4A5
򋒦
8B4A6
򋒧
8B4A7
򋒨
8B4A8
򋒩
8B4A9
򋒪
8B4AA
򋒫
8B4AB
򋒬
8B4AC
򋒭
8B4AD
򋒮
8B4AE
򋒯
8B4AF
A0
B0
򋒰
8B4B0
򋒱
8B4B1
򋒲
8B4B2
򋒳
8B4B3
򋒴
8B4B4
򋒵
8B4B5
򋒶
8B4B6
򋒷
8B4B7
򋒸
8B4B8
򋒹
8B4B9
򋒺
8B4BA
򋒻
8B4BB
򋒼
8B4BC
򋒽
8B4BD
򋒾
8B4BE
򋒿
8B4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]