International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F486A2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􆢀
106880
􆢁
106881
􆢂
106882
􆢃
106883
􆢄
106884
􆢅
106885
􆢆
106886
􆢇
106887
􆢈
106888
􆢉
106889
􆢊
10688A
􆢋
10688B
􆢌
10688C
􆢍
10688D
􆢎
10688E
􆢏
10688F
80
90
􆢐
106890
􆢑
106891
􆢒
106892
􆢓
106893
􆢔
106894
􆢕
106895
􆢖
106896
􆢗
106897
􆢘
106898
􆢙
106899
􆢚
10689A
􆢛
10689B
􆢜
10689C
􆢝
10689D
􆢞
10689E
􆢟
10689F
90
A0
􆢠
1068A0
􆢡
1068A1
􆢢
1068A2
􆢣
1068A3
􆢤
1068A4
􆢥
1068A5
􆢦
1068A6
􆢧
1068A7
􆢨
1068A8
􆢩
1068A9
􆢪
1068AA
􆢫
1068AB
􆢬
1068AC
􆢭
1068AD
􆢮
1068AE
􆢯
1068AF
A0
B0
􆢰
1068B0
􆢱
1068B1
􆢲
1068B2
􆢳
1068B3
􆢴
1068B4
􆢵
1068B5
􆢶
1068B6
􆢷
1068B7
􆢸
1068B8
􆢹
1068B9
􆢺
1068BA
􆢻
1068BB
􆢼
1068BC
􆢽
1068BD
􆢾
1068BE
􆢿
1068BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]