International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3AE93

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󮓀
EE4C0
󮓁
EE4C1
󮓂
EE4C2
󮓃
EE4C3
󮓄
EE4C4
󮓅
EE4C5
󮓆
EE4C6
󮓇
EE4C7
󮓈
EE4C8
󮓉
EE4C9
󮓊
EE4CA
󮓋
EE4CB
󮓌
EE4CC
󮓍
EE4CD
󮓎
EE4CE
󮓏
EE4CF
80
90
󮓐
EE4D0
󮓑
EE4D1
󮓒
EE4D2
󮓓
EE4D3
󮓔
EE4D4
󮓕
EE4D5
󮓖
EE4D6
󮓗
EE4D7
󮓘
EE4D8
󮓙
EE4D9
󮓚
EE4DA
󮓛
EE4DB
󮓜
EE4DC
󮓝
EE4DD
󮓞
EE4DE
󮓟
EE4DF
90
A0
󮓠
EE4E0
󮓡
EE4E1
󮓢
EE4E2
󮓣
EE4E3
󮓤
EE4E4
󮓥
EE4E5
󮓦
EE4E6
󮓧
EE4E7
󮓨
EE4E8
󮓩
EE4E9
󮓪
EE4EA
󮓫
EE4EB
󮓬
EE4EC
󮓭
EE4ED
󮓮
EE4EE
󮓯
EE4EF
A0
B0
󮓰
EE4F0
󮓱
EE4F1
󮓲
EE4F2
󮓳
EE4F3
󮓴
EE4F4
󮓵
EE4F5
󮓶
EE4F6
󮓷
EE4F7
󮓸
EE4F8
󮓹
EE4F9
󮓺
EE4FA
󮓻
EE4FB
󮓼
EE4FC
󮓽
EE4FD
󮓾
EE4FE
󮓿
EE4FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]