International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B5B8

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𵸀
35E00
𵸁
35E01
𵸂
35E02
𵸃
35E03
𵸄
35E04
𵸅
35E05
𵸆
35E06
𵸇
35E07
𵸈
35E08
𵸉
35E09
𵸊
35E0A
𵸋
35E0B
𵸌
35E0C
𵸍
35E0D
𵸎
35E0E
𵸏
35E0F
80
90
𵸐
35E10
𵸑
35E11
𵸒
35E12
𵸓
35E13
𵸔
35E14
𵸕
35E15
𵸖
35E16
𵸗
35E17
𵸘
35E18
𵸙
35E19
𵸚
35E1A
𵸛
35E1B
𵸜
35E1C
𵸝
35E1D
𵸞
35E1E
𵸟
35E1F
90
A0
𵸠
35E20
𵸡
35E21
𵸢
35E22
𵸣
35E23
𵸤
35E24
𵸥
35E25
𵸦
35E26
𵸧
35E27
𵸨
35E28
𵸩
35E29
𵸪
35E2A
𵸫
35E2B
𵸬
35E2C
𵸭
35E2D
𵸮
35E2E
𵸯
35E2F
A0
B0
𵸰
35E30
𵸱
35E31
𵸲
35E32
𵸳
35E33
𵸴
35E34
𵸵
35E35
𵸶
35E36
𵸷
35E37
𵸸
35E38
𵸹
35E39
𵸺
35E3A
𵸻
35E3B
𵸼
35E3C
𵸽
35E3D
𵸾
35E3E
𵸿
35E3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]