International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F291B6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򑶀
91D80
򑶁
91D81
򑶂
91D82
򑶃
91D83
򑶄
91D84
򑶅
91D85
򑶆
91D86
򑶇
91D87
򑶈
91D88
򑶉
91D89
򑶊
91D8A
򑶋
91D8B
򑶌
91D8C
򑶍
91D8D
򑶎
91D8E
򑶏
91D8F
80
90
򑶐
91D90
򑶑
91D91
򑶒
91D92
򑶓
91D93
򑶔
91D94
򑶕
91D95
򑶖
91D96
򑶗
91D97
򑶘
91D98
򑶙
91D99
򑶚
91D9A
򑶛
91D9B
򑶜
91D9C
򑶝
91D9D
򑶞
91D9E
򑶟
91D9F
90
A0
򑶠
91DA0
򑶡
91DA1
򑶢
91DA2
򑶣
91DA3
򑶤
91DA4
򑶥
91DA5
򑶦
91DA6
򑶧
91DA7
򑶨
91DA8
򑶩
91DA9
򑶪
91DAA
򑶫
91DAB
򑶬
91DAC
򑶭
91DAD
򑶮
91DAE
򑶯
91DAF
A0
B0
򑶰
91DB0
򑶱
91DB1
򑶲
91DB2
򑶳
91DB3
򑶴
91DB4
򑶵
91DB5
򑶶
91DB6
򑶷
91DB7
򑶸
91DB8
򑶹
91DB9
򑶺
91DBA
򑶻
91DBB
򑶼
91DBC
򑶽
91DBD
򑶾
91DBE
򑶿
91DBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]