International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1A4B1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񤱀
64C40
񤱁
64C41
񤱂
64C42
񤱃
64C43
񤱄
64C44
񤱅
64C45
񤱆
64C46
񤱇
64C47
񤱈
64C48
񤱉
64C49
񤱊
64C4A
񤱋
64C4B
񤱌
64C4C
񤱍
64C4D
񤱎
64C4E
񤱏
64C4F
80
90
񤱐
64C50
񤱑
64C51
񤱒
64C52
񤱓
64C53
񤱔
64C54
񤱕
64C55
񤱖
64C56
񤱗
64C57
񤱘
64C58
񤱙
64C59
񤱚
64C5A
񤱛
64C5B
񤱜
64C5C
񤱝
64C5D
񤱞
64C5E
񤱟
64C5F
90
A0
񤱠
64C60
񤱡
64C61
񤱢
64C62
񤱣
64C63
񤱤
64C64
񤱥
64C65
񤱦
64C66
񤱧
64C67
񤱨
64C68
񤱩
64C69
񤱪
64C6A
񤱫
64C6B
񤱬
64C6C
񤱭
64C6D
񤱮
64C6E
񤱯
64C6F
A0
B0
񤱰
64C70
񤱱
64C71
񤱲
64C72
񤱳
64C73
񤱴
64C74
񤱵
64C75
񤱶
64C76
񤱷
64C77
񤱸
64C78
񤱹
64C79
񤱺
64C7A
񤱻
64C7B
񤱼
64C7C
񤱽
64C7D
񤱾
64C7E
񤱿
64C7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]