International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B39C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𳜀
33700
𳜁
33701
𳜂
33702
𳜃
33703
𳜄
33704
𳜅
33705
𳜆
33706
𳜇
33707
𳜈
33708
𳜉
33709
𳜊
3370A
𳜋
3370B
𳜌
3370C
𳜍
3370D
𳜎
3370E
𳜏
3370F
80
90
𳜐
33710
𳜑
33711
𳜒
33712
𳜓
33713
𳜔
33714
𳜕
33715
𳜖
33716
𳜗
33717
𳜘
33718
𳜙
33719
𳜚
3371A
𳜛
3371B
𳜜
3371C
𳜝
3371D
𳜞
3371E
𳜟
3371F
90
A0
𳜠
33720
𳜡
33721
𳜢
33722
𳜣
33723
𳜤
33724
𳜥
33725
𳜦
33726
𳜧
33727
𳜨
33728
𳜩
33729
𳜪
3372A
𳜫
3372B
𳜬
3372C
𳜭
3372D
𳜮
3372E
𳜯
3372F
A0
B0
𳜰
33730
𳜱
33731
𳜲
33732
𳜳
33733
𳜴
33734
𳜵
33735
𳜶
33736
𳜷
33737
𳜸
33738
𳜹
33739
𳜺
3373A
𳜻
3373B
𳜼
3373C
𳜽
3373D
𳜾
3373E
𳜿
3373F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]