International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F382B5

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󂵀
C2D40
󂵁
C2D41
󂵂
C2D42
󂵃
C2D43
󂵄
C2D44
󂵅
C2D45
󂵆
C2D46
󂵇
C2D47
󂵈
C2D48
󂵉
C2D49
󂵊
C2D4A
󂵋
C2D4B
󂵌
C2D4C
󂵍
C2D4D
󂵎
C2D4E
󂵏
C2D4F
80
90
󂵐
C2D50
󂵑
C2D51
󂵒
C2D52
󂵓
C2D53
󂵔
C2D54
󂵕
C2D55
󂵖
C2D56
󂵗
C2D57
󂵘
C2D58
󂵙
C2D59
󂵚
C2D5A
󂵛
C2D5B
󂵜
C2D5C
󂵝
C2D5D
󂵞
C2D5E
󂵟
C2D5F
90
A0
󂵠
C2D60
󂵡
C2D61
󂵢
C2D62
󂵣
C2D63
󂵤
C2D64
󂵥
C2D65
󂵦
C2D66
󂵧
C2D67
󂵨
C2D68
󂵩
C2D69
󂵪
C2D6A
󂵫
C2D6B
󂵬
C2D6C
󂵭
C2D6D
󂵮
C2D6E
󂵯
C2D6F
A0
B0
󂵰
C2D70
󂵱
C2D71
󂵲
C2D72
󂵳
C2D73
󂵴
C2D74
󂵵
C2D75
󂵶
C2D76
󂵷
C2D77
󂵸
C2D78
󂵹
C2D79
󂵺
C2D7A
󂵻
C2D7B
󂵼
C2D7C
󂵽
C2D7D
󂵾
C2D7E
󂵿
C2D7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]