International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A4A9

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󤩀
E4A40
󤩁
E4A41
󤩂
E4A42
󤩃
E4A43
󤩄
E4A44
󤩅
E4A45
󤩆
E4A46
󤩇
E4A47
󤩈
E4A48
󤩉
E4A49
󤩊
E4A4A
󤩋
E4A4B
󤩌
E4A4C
󤩍
E4A4D
󤩎
E4A4E
󤩏
E4A4F
80
90
󤩐
E4A50
󤩑
E4A51
󤩒
E4A52
󤩓
E4A53
󤩔
E4A54
󤩕
E4A55
󤩖
E4A56
󤩗
E4A57
󤩘
E4A58
󤩙
E4A59
󤩚
E4A5A
󤩛
E4A5B
󤩜
E4A5C
󤩝
E4A5D
󤩞
E4A5E
󤩟
E4A5F
90
A0
󤩠
E4A60
󤩡
E4A61
󤩢
E4A62
󤩣
E4A63
󤩤
E4A64
󤩥
E4A65
󤩦
E4A66
󤩧
E4A67
󤩨
E4A68
󤩩
E4A69
󤩪
E4A6A
󤩫
E4A6B
󤩬
E4A6C
󤩭
E4A6D
󤩮
E4A6E
󤩯
E4A6F
A0
B0
󤩰
E4A70
󤩱
E4A71
󤩲
E4A72
󤩳
E4A73
󤩴
E4A74
󤩵
E4A75
󤩶
E4A76
󤩷
E4A77
󤩸
E4A78
󤩹
E4A79
󤩺
E4A7A
󤩻
E4A7B
󤩼
E4A7C
󤩽
E4A7D
󤩾
E4A7E
󤩿
E4A7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]