International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39087

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󐇀
D01C0
󐇁
D01C1
󐇂
D01C2
󐇃
D01C3
󐇄
D01C4
󐇅
D01C5
󐇆
D01C6
󐇇
D01C7
󐇈
D01C8
󐇉
D01C9
󐇊
D01CA
󐇋
D01CB
󐇌
D01CC
󐇍
D01CD
󐇎
D01CE
󐇏
D01CF
80
90
󐇐
D01D0
󐇑
D01D1
󐇒
D01D2
󐇓
D01D3
󐇔
D01D4
󐇕
D01D5
󐇖
D01D6
󐇗
D01D7
󐇘
D01D8
󐇙
D01D9
󐇚
D01DA
󐇛
D01DB
󐇜
D01DC
󐇝
D01DD
󐇞
D01DE
󐇟
D01DF
90
A0
󐇠
D01E0
󐇡
D01E1
󐇢
D01E2
󐇣
D01E3
󐇤
D01E4
󐇥
D01E5
󐇦
D01E6
󐇧
D01E7
󐇨
D01E8
󐇩
D01E9
󐇪
D01EA
󐇫
D01EB
󐇬
D01EC
󐇭
D01ED
󐇮
D01EE
󐇯
D01EF
A0
B0
󐇰
D01F0
󐇱
D01F1
󐇲
D01F2
󐇳
D01F3
󐇴
D01F4
󐇵
D01F5
󐇶
D01F6
󐇷
D01F7
󐇸
D01F8
󐇹
D01F9
󐇺
D01FA
󐇻
D01FB
󐇼
D01FC
󐇽
D01FD
󐇾
D01FE
󐇿
D01FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]