International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0BDB4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𽴀
3DD00
𽴁
3DD01
𽴂
3DD02
𽴃
3DD03
𽴄
3DD04
𽴅
3DD05
𽴆
3DD06
𽴇
3DD07
𽴈
3DD08
𽴉
3DD09
𽴊
3DD0A
𽴋
3DD0B
𽴌
3DD0C
𽴍
3DD0D
𽴎
3DD0E
𽴏
3DD0F
80
90
𽴐
3DD10
𽴑
3DD11
𽴒
3DD12
𽴓
3DD13
𽴔
3DD14
𽴕
3DD15
𽴖
3DD16
𽴗
3DD17
𽴘
3DD18
𽴙
3DD19
𽴚
3DD1A
𽴛
3DD1B
𽴜
3DD1C
𽴝
3DD1D
𽴞
3DD1E
𽴟
3DD1F
90
A0
𽴠
3DD20
𽴡
3DD21
𽴢
3DD22
𽴣
3DD23
𽴤
3DD24
𽴥
3DD25
𽴦
3DD26
𽴧
3DD27
𽴨
3DD28
𽴩
3DD29
𽴪
3DD2A
𽴫
3DD2B
𽴬
3DD2C
𽴭
3DD2D
𽴮
3DD2E
𽴯
3DD2F
A0
B0
𽴰
3DD30
𽴱
3DD31
𽴲
3DD32
𽴳
3DD33
𽴴
3DD34
𽴵
3DD35
𽴶
3DD36
𽴷
3DD37
𽴸
3DD38
𽴹
3DD39
𽴺
3DD3A
𽴻
3DD3B
𽴼
3DD3C
𽴽
3DD3D
𽴾
3DD3E
𽴿
3DD3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]