International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F382B9

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󂹀
C2E40
󂹁
C2E41
󂹂
C2E42
󂹃
C2E43
󂹄
C2E44
󂹅
C2E45
󂹆
C2E46
󂹇
C2E47
󂹈
C2E48
󂹉
C2E49
󂹊
C2E4A
󂹋
C2E4B
󂹌
C2E4C
󂹍
C2E4D
󂹎
C2E4E
󂹏
C2E4F
80
90
󂹐
C2E50
󂹑
C2E51
󂹒
C2E52
󂹓
C2E53
󂹔
C2E54
󂹕
C2E55
󂹖
C2E56
󂹗
C2E57
󂹘
C2E58
󂹙
C2E59
󂹚
C2E5A
󂹛
C2E5B
󂹜
C2E5C
󂹝
C2E5D
󂹞
C2E5E
󂹟
C2E5F
90
A0
󂹠
C2E60
󂹡
C2E61
󂹢
C2E62
󂹣
C2E63
󂹤
C2E64
󂹥
C2E65
󂹦
C2E66
󂹧
C2E67
󂹨
C2E68
󂹩
C2E69
󂹪
C2E6A
󂹫
C2E6B
󂹬
C2E6C
󂹭
C2E6D
󂹮
C2E6E
󂹯
C2E6F
A0
B0
󂹰
C2E70
󂹱
C2E71
󂹲
C2E72
󂹳
C2E73
󂹴
C2E74
󂹵
C2E75
󂹶
C2E76
󂹷
C2E77
󂹸
C2E78
󂹹
C2E79
󂹺
C2E7A
󂹻
C2E7B
󂹼
C2E7C
󂹽
C2E7D
󂹾
C2E7E
󂹿
C2E7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]