International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09E92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𞒀
1E480
𞒁
1E481
𞒂
1E482
𞒃
1E483
𞒄
1E484
𞒅
1E485
𞒆
1E486
𞒇
1E487
𞒈
1E488
𞒉
1E489
𞒊
1E48A
𞒋
1E48B
𞒌
1E48C
𞒍
1E48D
𞒎
1E48E
𞒏
1E48F
80
90
𞒐
1E490
𞒑
1E491
𞒒
1E492
𞒓
1E493
𞒔
1E494
𞒕
1E495
𞒖
1E496
𞒗
1E497
𞒘
1E498
𞒙
1E499
𞒚
1E49A
𞒛
1E49B
𞒜
1E49C
𞒝
1E49D
𞒞
1E49E
𞒟
1E49F
90
A0
𞒠
1E4A0
𞒡
1E4A1
𞒢
1E4A2
𞒣
1E4A3
𞒤
1E4A4
𞒥
1E4A5
𞒦
1E4A6
𞒧
1E4A7
𞒨
1E4A8
𞒩
1E4A9
𞒪
1E4AA
𞒫
1E4AB
𞒬
1E4AC
𞒭
1E4AD
𞒮
1E4AE
𞒯
1E4AF
A0
B0
𞒰
1E4B0
𞒱
1E4B1
𞒲
1E4B2
𞒳
1E4B3
𞒴
1E4B4
𞒵
1E4B5
𞒶
1E4B6
𞒷
1E4B7
𞒸
1E4B8
𞒹
1E4B9
𞒺
1E4BA
𞒻
1E4BB
𞒼
1E4BC
𞒽
1E4BD
𞒾
1E4BE
𞒿
1E4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]