International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2939C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򓜀
93700
򓜁
93701
򓜂
93702
򓜃
93703
򓜄
93704
򓜅
93705
򓜆
93706
򓜇
93707
򓜈
93708
򓜉
93709
򓜊
9370A
򓜋
9370B
򓜌
9370C
򓜍
9370D
򓜎
9370E
򓜏
9370F
80
90
򓜐
93710
򓜑
93711
򓜒
93712
򓜓
93713
򓜔
93714
򓜕
93715
򓜖
93716
򓜗
93717
򓜘
93718
򓜙
93719
򓜚
9371A
򓜛
9371B
򓜜
9371C
򓜝
9371D
򓜞
9371E
򓜟
9371F
90
A0
򓜠
93720
򓜡
93721
򓜢
93722
򓜣
93723
򓜤
93724
򓜥
93725
򓜦
93726
򓜧
93727
򓜨
93728
򓜩
93729
򓜪
9372A
򓜫
9372B
򓜬
9372C
򓜭
9372D
򓜮
9372E
򓜯
9372F
A0
B0
򓜰
93730
򓜱
93731
򓜲
93732
򓜳
93733
򓜴
93734
򓜵
93735
򓜶
93736
򓜷
93737
򓜸
93738
򓜹
93739
򓜺
9373A
򓜻
9373B
򓜼
9373C
򓜽
9373D
򓜾
9373E
򓜿
9373F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]