International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38C8B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󌋀
CC2C0
󌋁
CC2C1
󌋂
CC2C2
󌋃
CC2C3
󌋄
CC2C4
󌋅
CC2C5
󌋆
CC2C6
󌋇
CC2C7
󌋈
CC2C8
󌋉
CC2C9
󌋊
CC2CA
󌋋
CC2CB
󌋌
CC2CC
󌋍
CC2CD
󌋎
CC2CE
󌋏
CC2CF
80
90
󌋐
CC2D0
󌋑
CC2D1
󌋒
CC2D2
󌋓
CC2D3
󌋔
CC2D4
󌋕
CC2D5
󌋖
CC2D6
󌋗
CC2D7
󌋘
CC2D8
󌋙
CC2D9
󌋚
CC2DA
󌋛
CC2DB
󌋜
CC2DC
󌋝
CC2DD
󌋞
CC2DE
󌋟
CC2DF
90
A0
󌋠
CC2E0
󌋡
CC2E1
󌋢
CC2E2
󌋣
CC2E3
󌋤
CC2E4
󌋥
CC2E5
󌋦
CC2E6
󌋧
CC2E7
󌋨
CC2E8
󌋩
CC2E9
󌋪
CC2EA
󌋫
CC2EB
󌋬
CC2EC
󌋭
CC2ED
󌋮
CC2EE
󌋯
CC2EF
A0
B0
󌋰
CC2F0
󌋱
CC2F1
󌋲
CC2F2
󌋳
CC2F3
󌋴
CC2F4
󌋵
CC2F5
󌋶
CC2F6
󌋷
CC2F7
󌋸
CC2F8
󌋹
CC2F9
󌋺
CC2FA
󌋻
CC2FB
󌋼
CC2FC
󌋽
CC2FD
󌋾
CC2FE
󌋿
CC2FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]