International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F197B6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񗶀
57D80
񗶁
57D81
񗶂
57D82
񗶃
57D83
񗶄
57D84
񗶅
57D85
񗶆
57D86
񗶇
57D87
񗶈
57D88
񗶉
57D89
񗶊
57D8A
񗶋
57D8B
񗶌
57D8C
񗶍
57D8D
񗶎
57D8E
񗶏
57D8F
80
90
񗶐
57D90
񗶑
57D91
񗶒
57D92
񗶓
57D93
񗶔
57D94
񗶕
57D95
񗶖
57D96
񗶗
57D97
񗶘
57D98
񗶙
57D99
񗶚
57D9A
񗶛
57D9B
񗶜
57D9C
񗶝
57D9D
񗶞
57D9E
񗶟
57D9F
90
A0
񗶠
57DA0
񗶡
57DA1
񗶢
57DA2
񗶣
57DA3
񗶤
57DA4
񗶥
57DA5
񗶦
57DA6
񗶧
57DA7
񗶨
57DA8
񗶩
57DA9
񗶪
57DAA
񗶫
57DAB
񗶬
57DAC
񗶭
57DAD
񗶮
57DAE
񗶯
57DAF
A0
B0
񗶰
57DB0
񗶱
57DB1
񗶲
57DB2
񗶳
57DB3
񗶴
57DB4
񗶵
57DB5
񗶶
57DB6
񗶷
57DB7
񗶸
57DB8
񗶹
57DB9
񗶺
57DBA
񗶻
57DBB
񗶼
57DBC
񗶽
57DBD
񗶾
57DBE
񗶿
57DBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]