International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2BAB4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򺴀
BAD00
򺴁
BAD01
򺴂
BAD02
򺴃
BAD03
򺴄
BAD04
򺴅
BAD05
򺴆
BAD06
򺴇
BAD07
򺴈
BAD08
򺴉
BAD09
򺴊
BAD0A
򺴋
BAD0B
򺴌
BAD0C
򺴍
BAD0D
򺴎
BAD0E
򺴏
BAD0F
80
90
򺴐
BAD10
򺴑
BAD11
򺴒
BAD12
򺴓
BAD13
򺴔
BAD14
򺴕
BAD15
򺴖
BAD16
򺴗
BAD17
򺴘
BAD18
򺴙
BAD19
򺴚
BAD1A
򺴛
BAD1B
򺴜
BAD1C
򺴝
BAD1D
򺴞
BAD1E
򺴟
BAD1F
90
A0
򺴠
BAD20
򺴡
BAD21
򺴢
BAD22
򺴣
BAD23
򺴤
BAD24
򺴥
BAD25
򺴦
BAD26
򺴧
BAD27
򺴨
BAD28
򺴩
BAD29
򺴪
BAD2A
򺴫
BAD2B
򺴬
BAD2C
򺴭
BAD2D
򺴮
BAD2E
򺴯
BAD2F
A0
B0
򺴰
BAD30
򺴱
BAD31
򺴲
BAD32
򺴳
BAD33
򺴴
BAD34
򺴵
BAD35
򺴶
BAD36
򺴷
BAD37
򺴸
BAD38
򺴹
BAD39
򺴺
BAD3A
򺴻
BAD3B
򺴼
BAD3C
򺴽
BAD3D
򺴾
BAD3E
򺴿
BAD3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]