International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F4888F

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􈏀
1083C0
􈏁
1083C1
􈏂
1083C2
􈏃
1083C3
􈏄
1083C4
􈏅
1083C5
􈏆
1083C6
􈏇
1083C7
􈏈
1083C8
􈏉
1083C9
􈏊
1083CA
􈏋
1083CB
􈏌
1083CC
􈏍
1083CD
􈏎
1083CE
􈏏
1083CF
80
90
􈏐
1083D0
􈏑
1083D1
􈏒
1083D2
􈏓
1083D3
􈏔
1083D4
􈏕
1083D5
􈏖
1083D6
􈏗
1083D7
􈏘
1083D8
􈏙
1083D9
􈏚
1083DA
􈏛
1083DB
􈏜
1083DC
􈏝
1083DD
􈏞
1083DE
􈏟
1083DF
90
A0
􈏠
1083E0
􈏡
1083E1
􈏢
1083E2
􈏣
1083E3
􈏤
1083E4
􈏥
1083E5
􈏦
1083E6
􈏧
1083E7
􈏨
1083E8
􈏩
1083E9
􈏪
1083EA
􈏫
1083EB
􈏬
1083EC
􈏭
1083ED
􈏮
1083EE
􈏯
1083EF
A0
B0
􈏰
1083F0
􈏱
1083F1
􈏲
1083F2
􈏳
1083F3
􈏴
1083F4
􈏵
1083F5
􈏶
1083F6
􈏷
1083F7
􈏸
1083F8
􈏹
1083F9
􈏺
1083FA
􈏻
1083FB
􈏼
1083FC
􈏽
1083FD
􈏾
1083FE
􈏿
1083FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]