International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29493

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򔓀
944C0
򔓁
944C1
򔓂
944C2
򔓃
944C3
򔓄
944C4
򔓅
944C5
򔓆
944C6
򔓇
944C7
򔓈
944C8
򔓉
944C9
򔓊
944CA
򔓋
944CB
򔓌
944CC
򔓍
944CD
򔓎
944CE
򔓏
944CF
80
90
򔓐
944D0
򔓑
944D1
򔓒
944D2
򔓓
944D3
򔓔
944D4
򔓕
944D5
򔓖
944D6
򔓗
944D7
򔓘
944D8
򔓙
944D9
򔓚
944DA
򔓛
944DB
򔓜
944DC
򔓝
944DD
򔓞
944DE
򔓟
944DF
90
A0
򔓠
944E0
򔓡
944E1
򔓢
944E2
򔓣
944E3
򔓤
944E4
򔓥
944E5
򔓦
944E6
򔓧
944E7
򔓨
944E8
򔓩
944E9
򔓪
944EA
򔓫
944EB
򔓬
944EC
򔓭
944ED
򔓮
944EE
򔓯
944EF
A0
B0
򔓰
944F0
򔓱
944F1
򔓲
944F2
򔓳
944F3
򔓴
944F4
򔓵
944F5
򔓶
944F6
򔓷
944F7
򔓸
944F8
򔓹
944F9
򔓺
944FA
򔓻
944FB
򔓼
944FC
򔓽
944FD
򔓾
944FE
򔓿
944FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]