International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B5A2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󵢀
F5880
󵢁
F5881
󵢂
F5882
󵢃
F5883
󵢄
F5884
󵢅
F5885
󵢆
F5886
󵢇
F5887
󵢈
F5888
󵢉
F5889
󵢊
F588A
󵢋
F588B
󵢌
F588C
󵢍
F588D
󵢎
F588E
󵢏
F588F
80
90
󵢐
F5890
󵢑
F5891
󵢒
F5892
󵢓
F5893
󵢔
F5894
󵢕
F5895
󵢖
F5896
󵢗
F5897
󵢘
F5898
󵢙
F5899
󵢚
F589A
󵢛
F589B
󵢜
F589C
󵢝
F589D
󵢞
F589E
󵢟
F589F
90
A0
󵢠
F58A0
󵢡
F58A1
󵢢
F58A2
󵢣
F58A3
󵢤
F58A4
󵢥
F58A5
󵢦
F58A6
󵢧
F58A7
󵢨
F58A8
󵢩
F58A9
󵢪
F58AA
󵢫
F58AB
󵢬
F58AC
󵢭
F58AD
󵢮
F58AE
󵢯
F58AF
A0
B0
󵢰
F58B0
󵢱
F58B1
󵢲
F58B2
󵢳
F58B3
󵢴
F58B4
󵢵
F58B5
󵢶
F58B6
󵢷
F58B7
󵢸
F58B8
󵢹
F58B9
󵢺
F58BA
󵢻
F58BB
󵢼
F58BC
󵢽
F58BD
󵢾
F58BE
󵢿
F58BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]