International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A799

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򧙀
A7640
򧙁
A7641
򧙂
A7642
򧙃
A7643
򧙄
A7644
򧙅
A7645
򧙆
A7646
򧙇
A7647
򧙈
A7648
򧙉
A7649
򧙊
A764A
򧙋
A764B
򧙌
A764C
򧙍
A764D
򧙎
A764E
򧙏
A764F
80
90
򧙐
A7650
򧙑
A7651
򧙒
A7652
򧙓
A7653
򧙔
A7654
򧙕
A7655
򧙖
A7656
򧙗
A7657
򧙘
A7658
򧙙
A7659
򧙚
A765A
򧙛
A765B
򧙜
A765C
򧙝
A765D
򧙞
A765E
򧙟
A765F
90
A0
򧙠
A7660
򧙡
A7661
򧙢
A7662
򧙣
A7663
򧙤
A7664
򧙥
A7665
򧙦
A7666
򧙧
A7667
򧙨
A7668
򧙩
A7669
򧙪
A766A
򧙫
A766B
򧙬
A766C
򧙭
A766D
򧙮
A766E
򧙯
A766F
A0
B0
򧙰
A7670
򧙱
A7671
򧙲
A7672
򧙳
A7673
򧙴
A7674
򧙵
A7675
򧙶
A7676
򧙷
A7677
򧙸
A7678
򧙹
A7679
򧙺
A767A
򧙻
A767B
򧙼
A767C
򧙽
A767D
򧙾
A767E
򧙿
A767F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]