International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3898B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󉋀
C92C0
󉋁
C92C1
󉋂
C92C2
󉋃
C92C3
󉋄
C92C4
󉋅
C92C5
󉋆
C92C6
󉋇
C92C7
󉋈
C92C8
󉋉
C92C9
󉋊
C92CA
󉋋
C92CB
󉋌
C92CC
󉋍
C92CD
󉋎
C92CE
󉋏
C92CF
80
90
󉋐
C92D0
󉋑
C92D1
󉋒
C92D2
󉋓
C92D3
󉋔
C92D4
󉋕
C92D5
󉋖
C92D6
󉋗
C92D7
󉋘
C92D8
󉋙
C92D9
󉋚
C92DA
󉋛
C92DB
󉋜
C92DC
󉋝
C92DD
󉋞
C92DE
󉋟
C92DF
90
A0
󉋠
C92E0
󉋡
C92E1
󉋢
C92E2
󉋣
C92E3
󉋤
C92E4
󉋥
C92E5
󉋦
C92E6
󉋧
C92E7
󉋨
C92E8
󉋩
C92E9
󉋪
C92EA
󉋫
C92EB
󉋬
C92EC
󉋭
C92ED
󉋮
C92EE
󉋯
C92EF
A0
B0
󉋰
C92F0
󉋱
C92F1
󉋲
C92F2
󉋳
C92F3
󉋴
C92F4
󉋵
C92F5
󉋶
C92F6
󉋷
C92F7
󉋸
C92F8
󉋹
C92F9
󉋺
C92FA
󉋻
C92FB
󉋼
C92FC
󉋽
C92FD
󉋾
C92FE
󉋿
C92FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]