International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3809C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󀜀
C0700
󀜁
C0701
󀜂
C0702
󀜃
C0703
󀜄
C0704
󀜅
C0705
󀜆
C0706
󀜇
C0707
󀜈
C0708
󀜉
C0709
󀜊
C070A
󀜋
C070B
󀜌
C070C
󀜍
C070D
󀜎
C070E
󀜏
C070F
80
90
󀜐
C0710
󀜑
C0711
󀜒
C0712
󀜓
C0713
󀜔
C0714
󀜕
C0715
󀜖
C0716
󀜗
C0717
󀜘
C0718
󀜙
C0719
󀜚
C071A
󀜛
C071B
󀜜
C071C
󀜝
C071D
󀜞
C071E
󀜟
C071F
90
A0
󀜠
C0720
󀜡
C0721
󀜢
C0722
󀜣
C0723
󀜤
C0724
󀜥
C0725
󀜦
C0726
󀜧
C0727
󀜨
C0728
󀜩
C0729
󀜪
C072A
󀜫
C072B
󀜬
C072C
󀜭
C072D
󀜮
C072E
󀜯
C072F
A0
B0
󀜰
C0730
󀜱
C0731
󀜲
C0732
󀜳
C0733
󀜴
C0734
󀜵
C0735
󀜶
C0736
󀜷
C0737
󀜸
C0738
󀜹
C0739
󀜺
C073A
󀜻
C073B
󀜼
C073C
󀜽
C073D
󀜾
C073E
󀜿
C073F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]