International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38683

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󆃀
C60C0
󆃁
C60C1
󆃂
C60C2
󆃃
C60C3
󆃄
C60C4
󆃅
C60C5
󆃆
C60C6
󆃇
C60C7
󆃈
C60C8
󆃉
C60C9
󆃊
C60CA
󆃋
C60CB
󆃌
C60CC
󆃍
C60CD
󆃎
C60CE
󆃏
C60CF
80
90
󆃐
C60D0
󆃑
C60D1
󆃒
C60D2
󆃓
C60D3
󆃔
C60D4
󆃕
C60D5
󆃖
C60D6
󆃗
C60D7
󆃘
C60D8
󆃙
C60D9
󆃚
C60DA
󆃛
C60DB
󆃜
C60DC
󆃝
C60DD
󆃞
C60DE
󆃟
C60DF
90
A0
󆃠
C60E0
󆃡
C60E1
󆃢
C60E2
󆃣
C60E3
󆃤
C60E4
󆃥
C60E5
󆃦
C60E6
󆃧
C60E7
󆃨
C60E8
󆃩
C60E9
󆃪
C60EA
󆃫
C60EB
󆃬
C60EC
󆃭
C60ED
󆃮
C60EE
󆃯
C60EF
A0
B0
󆃰
C60F0
󆃱
C60F1
󆃲
C60F2
󆃳
C60F3
󆃴
C60F4
󆃵
C60F5
󆃶
C60F6
󆃷
C60F7
󆃸
C60F8
󆃹
C60F9
󆃺
C60FA
󆃻
C60FB
󆃼
C60FC
󆃽
C60FD
󆃾
C60FE
󆃿
C60FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]