International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3BF8B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󿋀
FF2C0
󿋁
FF2C1
󿋂
FF2C2
󿋃
FF2C3
󿋄
FF2C4
󿋅
FF2C5
󿋆
FF2C6
󿋇
FF2C7
󿋈
FF2C8
󿋉
FF2C9
󿋊
FF2CA
󿋋
FF2CB
󿋌
FF2CC
󿋍
FF2CD
󿋎
FF2CE
󿋏
FF2CF
80
90
󿋐
FF2D0
󿋑
FF2D1
󿋒
FF2D2
󿋓
FF2D3
󿋔
FF2D4
󿋕
FF2D5
󿋖
FF2D6
󿋗
FF2D7
󿋘
FF2D8
󿋙
FF2D9
󿋚
FF2DA
󿋛
FF2DB
󿋜
FF2DC
󿋝
FF2DD
󿋞
FF2DE
󿋟
FF2DF
90
A0
󿋠
FF2E0
󿋡
FF2E1
󿋢
FF2E2
󿋣
FF2E3
󿋤
FF2E4
󿋥
FF2E5
󿋦
FF2E6
󿋧
FF2E7
󿋨
FF2E8
󿋩
FF2E9
󿋪
FF2EA
󿋫
FF2EB
󿋬
FF2EC
󿋭
FF2ED
󿋮
FF2EE
󿋯
FF2EF
A0
B0
󿋰
FF2F0
󿋱
FF2F1
󿋲
FF2F2
󿋳
FF2F3
󿋴
FF2F4
󿋵
FF2F5
󿋶
FF2F6
󿋷
FF2F7
󿋸
FF2F8
󿋹
FF2F9
󿋺
FF2FA
󿋻
FF2FB
󿋼
FF2FC
󿋽
FF2FD
󿋾
FF2FE
󿋿
FF2FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]