International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B4B1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𴱀
34C40
𴱁
34C41
𴱂
34C42
𴱃
34C43
𴱄
34C44
𴱅
34C45
𴱆
34C46
𴱇
34C47
𴱈
34C48
𴱉
34C49
𴱊
34C4A
𴱋
34C4B
𴱌
34C4C
𴱍
34C4D
𴱎
34C4E
𴱏
34C4F
80
90
𴱐
34C50
𴱑
34C51
𴱒
34C52
𴱓
34C53
𴱔
34C54
𴱕
34C55
𴱖
34C56
𴱗
34C57
𴱘
34C58
𴱙
34C59
𴱚
34C5A
𴱛
34C5B
𴱜
34C5C
𴱝
34C5D
𴱞
34C5E
𴱟
34C5F
90
A0
𴱠
34C60
𴱡
34C61
𴱢
34C62
𴱣
34C63
𴱤
34C64
𴱥
34C65
𴱦
34C66
𴱧
34C67
𴱨
34C68
𴱩
34C69
𴱪
34C6A
𴱫
34C6B
𴱬
34C6C
𴱭
34C6D
𴱮
34C6E
𴱯
34C6F
A0
B0
𴱰
34C70
𴱱
34C71
𴱲
34C72
𴱳
34C73
𴱴
34C74
𴱵
34C75
𴱶
34C76
𴱷
34C77
𴱸
34C78
𴱹
34C79
𴱺
34C7A
𴱻
34C7B
𴱼
34C7C
𴱽
34C7D
𴱾
34C7E
𴱿
34C7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]