International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F482B1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􂱀
102C40
􂱁
102C41
􂱂
102C42
􂱃
102C43
􂱄
102C44
􂱅
102C45
􂱆
102C46
􂱇
102C47
􂱈
102C48
􂱉
102C49
􂱊
102C4A
􂱋
102C4B
􂱌
102C4C
􂱍
102C4D
􂱎
102C4E
􂱏
102C4F
80
90
􂱐
102C50
􂱑
102C51
􂱒
102C52
􂱓
102C53
􂱔
102C54
􂱕
102C55
􂱖
102C56
􂱗
102C57
􂱘
102C58
􂱙
102C59
􂱚
102C5A
􂱛
102C5B
􂱜
102C5C
􂱝
102C5D
􂱞
102C5E
􂱟
102C5F
90
A0
􂱠
102C60
􂱡
102C61
􂱢
102C62
􂱣
102C63
􂱤
102C64
􂱥
102C65
􂱦
102C66
􂱧
102C67
􂱨
102C68
􂱩
102C69
􂱪
102C6A
􂱫
102C6B
􂱬
102C6C
􂱭
102C6D
􂱮
102C6E
􂱯
102C6F
A0
B0
􂱰
102C70
􂱱
102C71
􂱲
102C72
􂱳
102C73
􂱴
102C74
􂱵
102C75
􂱶
102C76
􂱷
102C77
􂱸
102C78
􂱹
102C79
􂱺
102C7A
􂱻
102C7B
􂱼
102C7C
􂱽
102C7D
􂱾
102C7E
􂱿
102C7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]