International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F481A2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􁢀
101880
􁢁
101881
􁢂
101882
􁢃
101883
􁢄
101884
􁢅
101885
􁢆
101886
􁢇
101887
􁢈
101888
􁢉
101889
􁢊
10188A
􁢋
10188B
􁢌
10188C
􁢍
10188D
􁢎
10188E
􁢏
10188F
80
90
􁢐
101890
􁢑
101891
􁢒
101892
􁢓
101893
􁢔
101894
􁢕
101895
􁢖
101896
􁢗
101897
􁢘
101898
􁢙
101899
􁢚
10189A
􁢛
10189B
􁢜
10189C
􁢝
10189D
􁢞
10189E
􁢟
10189F
90
A0
􁢠
1018A0
􁢡
1018A1
􁢢
1018A2
􁢣
1018A3
􁢤
1018A4
􁢥
1018A5
􁢦
1018A6
􁢧
1018A7
􁢨
1018A8
􁢩
1018A9
􁢪
1018AA
􁢫
1018AB
􁢬
1018AC
􁢭
1018AD
􁢮
1018AE
􁢯
1018AF
A0
B0
􁢰
1018B0
􁢱
1018B1
􁢲
1018B2
􁢳
1018B3
􁢴
1018B4
􁢵
1018B5
􁢶
1018B6
􁢷
1018B7
􁢸
1018B8
􁢹
1018B9
􁢺
1018BA
􁢻
1018BB
􁢼
1018BC
􁢽
1018BD
􁢾
1018BE
􁢿
1018BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]