International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3839B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󃛀
C36C0
󃛁
C36C1
󃛂
C36C2
󃛃
C36C3
󃛄
C36C4
󃛅
C36C5
󃛆
C36C6
󃛇
C36C7
󃛈
C36C8
󃛉
C36C9
󃛊
C36CA
󃛋
C36CB
󃛌
C36CC
󃛍
C36CD
󃛎
C36CE
󃛏
C36CF
80
90
󃛐
C36D0
󃛑
C36D1
󃛒
C36D2
󃛓
C36D3
󃛔
C36D4
󃛕
C36D5
󃛖
C36D6
󃛗
C36D7
󃛘
C36D8
󃛙
C36D9
󃛚
C36DA
󃛛
C36DB
󃛜
C36DC
󃛝
C36DD
󃛞
C36DE
󃛟
C36DF
90
A0
󃛠
C36E0
󃛡
C36E1
󃛢
C36E2
󃛣
C36E3
󃛤
C36E4
󃛥
C36E5
󃛦
C36E6
󃛧
C36E7
󃛨
C36E8
󃛩
C36E9
󃛪
C36EA
󃛫
C36EB
󃛬
C36EC
󃛭
C36ED
󃛮
C36EE
󃛯
C36EF
A0
B0
󃛰
C36F0
󃛱
C36F1
󃛲
C36F2
󃛳
C36F3
󃛴
C36F4
󃛵
C36F5
󃛶
C36F6
󃛷
C36F7
󃛸
C36F8
󃛹
C36F9
󃛺
C36FA
󃛻
C36FB
󃛼
C36FC
󃛽
C36FD
󃛾
C36FE
󃛿
C36FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]