International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38E81

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󎁀
CE040
󎁁
CE041
󎁂
CE042
󎁃
CE043
󎁄
CE044
󎁅
CE045
󎁆
CE046
󎁇
CE047
󎁈
CE048
󎁉
CE049
󎁊
CE04A
󎁋
CE04B
󎁌
CE04C
󎁍
CE04D
󎁎
CE04E
󎁏
CE04F
80
90
󎁐
CE050
󎁑
CE051
󎁒
CE052
󎁓
CE053
󎁔
CE054
󎁕
CE055
󎁖
CE056
󎁗
CE057
󎁘
CE058
󎁙
CE059
󎁚
CE05A
󎁛
CE05B
󎁜
CE05C
󎁝
CE05D
󎁞
CE05E
󎁟
CE05F
90
A0
󎁠
CE060
󎁡
CE061
󎁢
CE062
󎁣
CE063
󎁤
CE064
󎁥
CE065
󎁦
CE066
󎁧
CE067
󎁨
CE068
󎁩
CE069
󎁪
CE06A
󎁫
CE06B
󎁬
CE06C
󎁭
CE06D
󎁮
CE06E
󎁯
CE06F
A0
B0
󎁰
CE070
󎁱
CE071
󎁲
CE072
󎁳
CE073
󎁴
CE074
󎁵
CE075
󎁶
CE076
󎁷
CE077
󎁸
CE078
󎁹
CE079
󎁺
CE07A
󎁻
CE07B
󎁼
CE07C
󎁽
CE07D
󎁾
CE07E
󎁿
CE07F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]