International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09B92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𛒀
1B480
𛒁
1B481
𛒂
1B482
𛒃
1B483
𛒄
1B484
𛒅
1B485
𛒆
1B486
𛒇
1B487
𛒈
1B488
𛒉
1B489
𛒊
1B48A
𛒋
1B48B
𛒌
1B48C
𛒍
1B48D
𛒎
1B48E
𛒏
1B48F
80
90
𛒐
1B490
𛒑
1B491
𛒒
1B492
𛒓
1B493
𛒔
1B494
𛒕
1B495
𛒖
1B496
𛒗
1B497
𛒘
1B498
𛒙
1B499
𛒚
1B49A
𛒛
1B49B
𛒜
1B49C
𛒝
1B49D
𛒞
1B49E
𛒟
1B49F
90
A0
𛒠
1B4A0
𛒡
1B4A1
𛒢
1B4A2
𛒣
1B4A3
𛒤
1B4A4
𛒥
1B4A5
𛒦
1B4A6
𛒧
1B4A7
𛒨
1B4A8
𛒩
1B4A9
𛒪
1B4AA
𛒫
1B4AB
𛒬
1B4AC
𛒭
1B4AD
𛒮
1B4AE
𛒯
1B4AF
A0
B0
𛒰
1B4B0
𛒱
1B4B1
𛒲
1B4B2
𛒳
1B4B3
𛒴
1B4B4
𛒵
1B4B5
𛒶
1B4B6
𛒷
1B4B7
𛒸
1B4B8
𛒹
1B4B9
𛒺
1B4BA
𛒻
1B4BB
𛒼
1B4BC
𛒽
1B4BD
𛒾
1B4BE
𛒿
1B4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]