International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38C93

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󌓀
CC4C0
󌓁
CC4C1
󌓂
CC4C2
󌓃
CC4C3
󌓄
CC4C4
󌓅
CC4C5
󌓆
CC4C6
󌓇
CC4C7
󌓈
CC4C8
󌓉
CC4C9
󌓊
CC4CA
󌓋
CC4CB
󌓌
CC4CC
󌓍
CC4CD
󌓎
CC4CE
󌓏
CC4CF
80
90
󌓐
CC4D0
󌓑
CC4D1
󌓒
CC4D2
󌓓
CC4D3
󌓔
CC4D4
󌓕
CC4D5
󌓖
CC4D6
󌓗
CC4D7
󌓘
CC4D8
󌓙
CC4D9
󌓚
CC4DA
󌓛
CC4DB
󌓜
CC4DC
󌓝
CC4DD
󌓞
CC4DE
󌓟
CC4DF
90
A0
󌓠
CC4E0
󌓡
CC4E1
󌓢
CC4E2
󌓣
CC4E3
󌓤
CC4E4
󌓥
CC4E5
󌓦
CC4E6
󌓧
CC4E7
󌓨
CC4E8
󌓩
CC4E9
󌓪
CC4EA
󌓫
CC4EB
󌓬
CC4EC
󌓭
CC4ED
󌓮
CC4EE
󌓯
CC4EF
A0
B0
󌓰
CC4F0
󌓱
CC4F1
󌓲
CC4F2
󌓳
CC4F3
󌓴
CC4F4
󌓵
CC4F5
󌓶
CC4F6
󌓷
CC4F7
󌓸
CC4F8
󌓹
CC4F9
󌓺
CC4FA
󌓻
CC4FB
󌓼
CC4FC
󌓽
CC4FD
󌓾
CC4FE
󌓿
CC4FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]