International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A393

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򣓀
A34C0
򣓁
A34C1
򣓂
A34C2
򣓃
A34C3
򣓄
A34C4
򣓅
A34C5
򣓆
A34C6
򣓇
A34C7
򣓈
A34C8
򣓉
A34C9
򣓊
A34CA
򣓋
A34CB
򣓌
A34CC
򣓍
A34CD
򣓎
A34CE
򣓏
A34CF
80
90
򣓐
A34D0
򣓑
A34D1
򣓒
A34D2
򣓓
A34D3
򣓔
A34D4
򣓕
A34D5
򣓖
A34D6
򣓗
A34D7
򣓘
A34D8
򣓙
A34D9
򣓚
A34DA
򣓛
A34DB
򣓜
A34DC
򣓝
A34DD
򣓞
A34DE
򣓟
A34DF
90
A0
򣓠
A34E0
򣓡
A34E1
򣓢
A34E2
򣓣
A34E3
򣓤
A34E4
򣓥
A34E5
򣓦
A34E6
򣓧
A34E7
򣓨
A34E8
򣓩
A34E9
򣓪
A34EA
򣓫
A34EB
򣓬
A34EC
򣓭
A34ED
򣓮
A34EE
򣓯
A34EF
A0
B0
򣓰
A34F0
򣓱
A34F1
򣓲
A34F2
򣓳
A34F3
򣓴
A34F4
򣓵
A34F5
򣓶
A34F6
򣓷
A34F7
򣓸
A34F8
򣓹
A34F9
򣓺
A34FA
򣓻
A34FB
򣓼
A34FC
򣓽
A34FD
򣓾
A34FE
򣓿
A34FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]