International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B0B6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󰶀
F0D80
󰶁
F0D81
󰶂
F0D82
󰶃
F0D83
󰶄
F0D84
󰶅
F0D85
󰶆
F0D86
󰶇
F0D87
󰶈
F0D88
󰶉
F0D89
󰶊
F0D8A
󰶋
F0D8B
󰶌
F0D8C
󰶍
F0D8D
󰶎
F0D8E
󰶏
F0D8F
80
90
󰶐
F0D90
󰶑
F0D91
󰶒
F0D92
󰶓
F0D93
󰶔
F0D94
󰶕
F0D95
󰶖
F0D96
󰶗
F0D97
󰶘
F0D98
󰶙
F0D99
󰶚
F0D9A
󰶛
F0D9B
󰶜
F0D9C
󰶝
F0D9D
󰶞
F0D9E
󰶟
F0D9F
90
A0
󰶠
F0DA0
󰶡
F0DA1
󰶢
F0DA2
󰶣
F0DA3
󰶤
F0DA4
󰶥
F0DA5
󰶦
F0DA6
󰶧
F0DA7
󰶨
F0DA8
󰶩
F0DA9
󰶪
F0DAA
󰶫
F0DAB
󰶬
F0DAC
󰶭
F0DAD
󰶮
F0DAE
󰶯
F0DAF
A0
B0
󰶰
F0DB0
󰶱
F0DB1
󰶲
F0DB2
󰶳
F0DB3
󰶴
F0DB4
󰶵
F0DB5
󰶶
F0DB6
󰶷
F0DB7
󰶸
F0DB8
󰶹
F0DB9
󰶺
F0DBA
󰶻
F0DBB
󰶼
F0DBC
󰶽
F0DBD
󰶾
F0DBE
󰶿
F0DBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]