International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38193

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󁓀
C14C0
󁓁
C14C1
󁓂
C14C2
󁓃
C14C3
󁓄
C14C4
󁓅
C14C5
󁓆
C14C6
󁓇
C14C7
󁓈
C14C8
󁓉
C14C9
󁓊
C14CA
󁓋
C14CB
󁓌
C14CC
󁓍
C14CD
󁓎
C14CE
󁓏
C14CF
80
90
󁓐
C14D0
󁓑
C14D1
󁓒
C14D2
󁓓
C14D3
󁓔
C14D4
󁓕
C14D5
󁓖
C14D6
󁓗
C14D7
󁓘
C14D8
󁓙
C14D9
󁓚
C14DA
󁓛
C14DB
󁓜
C14DC
󁓝
C14DD
󁓞
C14DE
󁓟
C14DF
90
A0
󁓠
C14E0
󁓡
C14E1
󁓢
C14E2
󁓣
C14E3
󁓤
C14E4
󁓥
C14E5
󁓦
C14E6
󁓧
C14E7
󁓨
C14E8
󁓩
C14E9
󁓪
C14EA
󁓫
C14EB
󁓬
C14EC
󁓭
C14ED
󁓮
C14EE
󁓯
C14EF
A0
B0
󁓰
C14F0
󁓱
C14F1
󁓲
C14F2
󁓳
C14F3
󁓴
C14F4
󁓵
C14F5
󁓶
C14F6
󁓷
C14F7
󁓸
C14F8
󁓹
C14F9
󁓺
C14FA
󁓻
C14FB
󁓼
C14FC
󁓽
C14FD
󁓾
C14FE
󁓿
C14FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]