International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A6A2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󦢀
E6880
󦢁
E6881
󦢂
E6882
󦢃
E6883
󦢄
E6884
󦢅
E6885
󦢆
E6886
󦢇
E6887
󦢈
E6888
󦢉
E6889
󦢊
E688A
󦢋
E688B
󦢌
E688C
󦢍
E688D
󦢎
E688E
󦢏
E688F
80
90
󦢐
E6890
󦢑
E6891
󦢒
E6892
󦢓
E6893
󦢔
E6894
󦢕
E6895
󦢖
E6896
󦢗
E6897
󦢘
E6898
󦢙
E6899
󦢚
E689A
󦢛
E689B
󦢜
E689C
󦢝
E689D
󦢞
E689E
󦢟
E689F
90
A0
󦢠
E68A0
󦢡
E68A1
󦢢
E68A2
󦢣
E68A3
󦢤
E68A4
󦢥
E68A5
󦢦
E68A6
󦢧
E68A7
󦢨
E68A8
󦢩
E68A9
󦢪
E68AA
󦢫
E68AB
󦢬
E68AC
󦢭
E68AD
󦢮
E68AE
󦢯
E68AF
A0
B0
󦢰
E68B0
󦢱
E68B1
󦢲
E68B2
󦢳
E68B3
󦢴
E68B4
󦢵
E68B5
󦢶
E68B6
󦢷
E68B7
󦢸
E68B8
󦢹
E68B9
󦢺
E68BA
󦢻
E68BB
󦢼
E68BC
󦢽
E68BD
󦢾
E68BE
󦢿
E68BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]