International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1AA92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񪒀
6A480
񪒁
6A481
񪒂
6A482
񪒃
6A483
񪒄
6A484
񪒅
6A485
񪒆
6A486
񪒇
6A487
񪒈
6A488
񪒉
6A489
񪒊
6A48A
񪒋
6A48B
񪒌
6A48C
񪒍
6A48D
񪒎
6A48E
񪒏
6A48F
80
90
񪒐
6A490
񪒑
6A491
񪒒
6A492
񪒓
6A493
񪒔
6A494
񪒕
6A495
񪒖
6A496
񪒗
6A497
񪒘
6A498
񪒙
6A499
񪒚
6A49A
񪒛
6A49B
񪒜
6A49C
񪒝
6A49D
񪒞
6A49E
񪒟
6A49F
90
A0
񪒠
6A4A0
񪒡
6A4A1
񪒢
6A4A2
񪒣
6A4A3
񪒤
6A4A4
񪒥
6A4A5
񪒦
6A4A6
񪒧
6A4A7
񪒨
6A4A8
񪒩
6A4A9
񪒪
6A4AA
񪒫
6A4AB
񪒬
6A4AC
񪒭
6A4AD
񪒮
6A4AE
񪒯
6A4AF
A0
B0
񪒰
6A4B0
񪒱
6A4B1
񪒲
6A4B2
񪒳
6A4B3
񪒴
6A4B4
񪒵
6A4B5
񪒶
6A4B6
񪒷
6A4B7
񪒸
6A4B8
񪒹
6A4B9
񪒺
6A4BA
񪒻
6A4BB
񪒼
6A4BC
񪒽
6A4BD
񪒾
6A4BE
񪒿
6A4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]