International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F383B1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󃱀
C3C40
󃱁
C3C41
󃱂
C3C42
󃱃
C3C43
󃱄
C3C44
󃱅
C3C45
󃱆
C3C46
󃱇
C3C47
󃱈
C3C48
󃱉
C3C49
󃱊
C3C4A
󃱋
C3C4B
󃱌
C3C4C
󃱍
C3C4D
󃱎
C3C4E
󃱏
C3C4F
80
90
󃱐
C3C50
󃱑
C3C51
󃱒
C3C52
󃱓
C3C53
󃱔
C3C54
󃱕
C3C55
󃱖
C3C56
󃱗
C3C57
󃱘
C3C58
󃱙
C3C59
󃱚
C3C5A
󃱛
C3C5B
󃱜
C3C5C
󃱝
C3C5D
󃱞
C3C5E
󃱟
C3C5F
90
A0
󃱠
C3C60
󃱡
C3C61
󃱢
C3C62
󃱣
C3C63
󃱤
C3C64
󃱥
C3C65
󃱦
C3C66
󃱧
C3C67
󃱨
C3C68
󃱩
C3C69
󃱪
C3C6A
󃱫
C3C6B
󃱬
C3C6C
󃱭
C3C6D
󃱮
C3C6E
󃱯
C3C6F
A0
B0
󃱰
C3C70
󃱱
C3C71
󃱲
C3C72
󃱳
C3C73
󃱴
C3C74
󃱵
C3C75
󃱶
C3C76
󃱷
C3C77
󃱸
C3C78
󃱹
C3C79
󃱺
C3C7A
󃱻
C3C7B
󃱼
C3C7C
󃱽
C3C7D
󃱾
C3C7E
󃱿
C3C7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]