International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A7A2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󧢀
E7880
󧢁
E7881
󧢂
E7882
󧢃
E7883
󧢄
E7884
󧢅
E7885
󧢆
E7886
󧢇
E7887
󧢈
E7888
󧢉
E7889
󧢊
E788A
󧢋
E788B
󧢌
E788C
󧢍
E788D
󧢎
E788E
󧢏
E788F
80
90
󧢐
E7890
󧢑
E7891
󧢒
E7892
󧢓
E7893
󧢔
E7894
󧢕
E7895
󧢖
E7896
󧢗
E7897
󧢘
E7898
󧢙
E7899
󧢚
E789A
󧢛
E789B
󧢜
E789C
󧢝
E789D
󧢞
E789E
󧢟
E789F
90
A0
󧢠
E78A0
󧢡
E78A1
󧢢
E78A2
󧢣
E78A3
󧢤
E78A4
󧢥
E78A5
󧢦
E78A6
󧢧
E78A7
󧢨
E78A8
󧢩
E78A9
󧢪
E78AA
󧢫
E78AB
󧢬
E78AC
󧢭
E78AD
󧢮
E78AE
󧢯
E78AF
A0
B0
󧢰
E78B0
󧢱
E78B1
󧢲
E78B2
󧢳
E78B3
󧢴
E78B4
󧢵
E78B5
󧢶
E78B6
󧢷
E78B7
󧢸
E78B8
󧢹
E78B9
󧢺
E78BA
󧢻
E78BB
󧢼
E78BC
󧢽
E78BD
󧢾
E78BE
󧢿
E78BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]