International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3858B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󅋀
C52C0
󅋁
C52C1
󅋂
C52C2
󅋃
C52C3
󅋄
C52C4
󅋅
C52C5
󅋆
C52C6
󅋇
C52C7
󅋈
C52C8
󅋉
C52C9
󅋊
C52CA
󅋋
C52CB
󅋌
C52CC
󅋍
C52CD
󅋎
C52CE
󅋏
C52CF
80
90
󅋐
C52D0
󅋑
C52D1
󅋒
C52D2
󅋓
C52D3
󅋔
C52D4
󅋕
C52D5
󅋖
C52D6
󅋗
C52D7
󅋘
C52D8
󅋙
C52D9
󅋚
C52DA
󅋛
C52DB
󅋜
C52DC
󅋝
C52DD
󅋞
C52DE
󅋟
C52DF
90
A0
󅋠
C52E0
󅋡
C52E1
󅋢
C52E2
󅋣
C52E3
󅋤
C52E4
󅋥
C52E5
󅋦
C52E6
󅋧
C52E7
󅋨
C52E8
󅋩
C52E9
󅋪
C52EA
󅋫
C52EB
󅋬
C52EC
󅋭
C52ED
󅋮
C52EE
󅋯
C52EF
A0
B0
󅋰
C52F0
󅋱
C52F1
󅋲
C52F2
󅋳
C52F3
󅋴
C52F4
󅋵
C52F5
󅋶
C52F6
󅋷
C52F7
󅋸
C52F8
󅋹
C52F9
󅋺
C52FA
󅋻
C52FB
󅋼
C52FC
󅋽
C52FD
󅋾
C52FE
󅋿
C52FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]