International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B8B1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𸱀
38C40
𸱁
38C41
𸱂
38C42
𸱃
38C43
𸱄
38C44
𸱅
38C45
𸱆
38C46
𸱇
38C47
𸱈
38C48
𸱉
38C49
𸱊
38C4A
𸱋
38C4B
𸱌
38C4C
𸱍
38C4D
𸱎
38C4E
𸱏
38C4F
80
90
𸱐
38C50
𸱑
38C51
𸱒
38C52
𸱓
38C53
𸱔
38C54
𸱕
38C55
𸱖
38C56
𸱗
38C57
𸱘
38C58
𸱙
38C59
𸱚
38C5A
𸱛
38C5B
𸱜
38C5C
𸱝
38C5D
𸱞
38C5E
𸱟
38C5F
90
A0
𸱠
38C60
𸱡
38C61
𸱢
38C62
𸱣
38C63
𸱤
38C64
𸱥
38C65
𸱦
38C66
𸱧
38C67
𸱨
38C68
𸱩
38C69
𸱪
38C6A
𸱫
38C6B
𸱬
38C6C
𸱭
38C6D
𸱮
38C6E
𸱯
38C6F
A0
B0
𸱰
38C70
𸱱
38C71
𸱲
38C72
𸱳
38C73
𸱴
38C74
𸱵
38C75
𸱶
38C76
𸱷
38C77
𸱸
38C78
𸱹
38C79
𸱺
38C7A
𸱻
38C7B
𸱼
38C7C
𸱽
38C7D
𸱾
38C7E
𸱿
38C7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]