International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F395B6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󕶀
D5D80
󕶁
D5D81
󕶂
D5D82
󕶃
D5D83
󕶄
D5D84
󕶅
D5D85
󕶆
D5D86
󕶇
D5D87
󕶈
D5D88
󕶉
D5D89
󕶊
D5D8A
󕶋
D5D8B
󕶌
D5D8C
󕶍
D5D8D
󕶎
D5D8E
󕶏
D5D8F
80
90
󕶐
D5D90
󕶑
D5D91
󕶒
D5D92
󕶓
D5D93
󕶔
D5D94
󕶕
D5D95
󕶖
D5D96
󕶗
D5D97
󕶘
D5D98
󕶙
D5D99
󕶚
D5D9A
󕶛
D5D9B
󕶜
D5D9C
󕶝
D5D9D
󕶞
D5D9E
󕶟
D5D9F
90
A0
󕶠
D5DA0
󕶡
D5DA1
󕶢
D5DA2
󕶣
D5DA3
󕶤
D5DA4
󕶥
D5DA5
󕶦
D5DA6
󕶧
D5DA7
󕶨
D5DA8
󕶩
D5DA9
󕶪
D5DAA
󕶫
D5DAB
󕶬
D5DAC
󕶭
D5DAD
󕶮
D5DAE
󕶯
D5DAF
A0
B0
󕶰
D5DB0
󕶱
D5DB1
󕶲
D5DB2
󕶳
D5DB3
󕶴
D5DB4
󕶵
D5DB5
󕶶
D5DB6
󕶷
D5DB7
󕶸
D5DB8
󕶹
D5DB9
󕶺
D5DBA
󕶻
D5DBB
󕶼
D5DBC
󕶽
D5DBD
󕶾
D5DBE
󕶿
D5DBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]