International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B9A2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󹢀
F9880
󹢁
F9881
󹢂
F9882
󹢃
F9883
󹢄
F9884
󹢅
F9885
󹢆
F9886
󹢇
F9887
󹢈
F9888
󹢉
F9889
󹢊
F988A
󹢋
F988B
󹢌
F988C
󹢍
F988D
󹢎
F988E
󹢏
F988F
80
90
󹢐
F9890
󹢑
F9891
󹢒
F9892
󹢓
F9893
󹢔
F9894
󹢕
F9895
󹢖
F9896
󹢗
F9897
󹢘
F9898
󹢙
F9899
󹢚
F989A
󹢛
F989B
󹢜
F989C
󹢝
F989D
󹢞
F989E
󹢟
F989F
90
A0
󹢠
F98A0
󹢡
F98A1
󹢢
F98A2
󹢣
F98A3
󹢤
F98A4
󹢥
F98A5
󹢦
F98A6
󹢧
F98A7
󹢨
F98A8
󹢩
F98A9
󹢪
F98AA
󹢫
F98AB
󹢬
F98AC
󹢭
F98AD
󹢮
F98AE
󹢯
F98AF
A0
B0
󹢰
F98B0
󹢱
F98B1
󹢲
F98B2
󹢳
F98B3
󹢴
F98B4
󹢵
F98B5
󹢶
F98B6
󹢷
F98B7
󹢸
F98B8
󹢹
F98B9
󹢺
F98BA
󹢻
F98BB
󹢼
F98BC
󹢽
F98BD
󹢾
F98BE
󹢿
F98BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]