International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09AB0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𚰀
1AC00
𚰁
1AC01
𚰂
1AC02
𚰃
1AC03
𚰄
1AC04
𚰅
1AC05
𚰆
1AC06
𚰇
1AC07
𚰈
1AC08
𚰉
1AC09
𚰊
1AC0A
𚰋
1AC0B
𚰌
1AC0C
𚰍
1AC0D
𚰎
1AC0E
𚰏
1AC0F
80
90
𚰐
1AC10
𚰑
1AC11
𚰒
1AC12
𚰓
1AC13
𚰔
1AC14
𚰕
1AC15
𚰖
1AC16
𚰗
1AC17
𚰘
1AC18
𚰙
1AC19
𚰚
1AC1A
𚰛
1AC1B
𚰜
1AC1C
𚰝
1AC1D
𚰞
1AC1E
𚰟
1AC1F
90
A0
𚰠
1AC20
𚰡
1AC21
𚰢
1AC22
𚰣
1AC23
𚰤
1AC24
𚰥
1AC25
𚰦
1AC26
𚰧
1AC27
𚰨
1AC28
𚰩
1AC29
𚰪
1AC2A
𚰫
1AC2B
𚰬
1AC2C
𚰭
1AC2D
𚰮
1AC2E
𚰯
1AC2F
A0
B0
𚰰
1AC30
𚰱
1AC31
𚰲
1AC32
𚰳
1AC33
𚰴
1AC34
𚰵
1AC35
𚰶
1AC36
𚰷
1AC37
𚰸
1AC38
𚰹
1AC39
𚰺
1AC3A
𚰻
1AC3B
𚰼
1AC3C
𚰽
1AC3D
𚰾
1AC3E
𚰿
1AC3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]