International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09A92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𚒀
1A480
𚒁
1A481
𚒂
1A482
𚒃
1A483
𚒄
1A484
𚒅
1A485
𚒆
1A486
𚒇
1A487
𚒈
1A488
𚒉
1A489
𚒊
1A48A
𚒋
1A48B
𚒌
1A48C
𚒍
1A48D
𚒎
1A48E
𚒏
1A48F
80
90
𚒐
1A490
𚒑
1A491
𚒒
1A492
𚒓
1A493
𚒔
1A494
𚒕
1A495
𚒖
1A496
𚒗
1A497
𚒘
1A498
𚒙
1A499
𚒚
1A49A
𚒛
1A49B
𚒜
1A49C
𚒝
1A49D
𚒞
1A49E
𚒟
1A49F
90
A0
𚒠
1A4A0
𚒡
1A4A1
𚒢
1A4A2
𚒣
1A4A3
𚒤
1A4A4
𚒥
1A4A5
𚒦
1A4A6
𚒧
1A4A7
𚒨
1A4A8
𚒩
1A4A9
𚒪
1A4AA
𚒫
1A4AB
𚒬
1A4AC
𚒭
1A4AD
𚒮
1A4AE
𚒯
1A4AF
A0
B0
𚒰
1A4B0
𚒱
1A4B1
𚒲
1A4B2
𚒳
1A4B3
𚒴
1A4B4
𚒵
1A4B5
𚒶
1A4B6
𚒷
1A4B7
𚒸
1A4B8
𚒹
1A4B9
𚒺
1A4BA
𚒻
1A4BB
𚒼
1A4BC
𚒽
1A4BD
𚒾
1A4BE
𚒿
1A4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]