International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A387

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򣇀
A31C0
򣇁
A31C1
򣇂
A31C2
򣇃
A31C3
򣇄
A31C4
򣇅
A31C5
򣇆
A31C6
򣇇
A31C7
򣇈
A31C8
򣇉
A31C9
򣇊
A31CA
򣇋
A31CB
򣇌
A31CC
򣇍
A31CD
򣇎
A31CE
򣇏
A31CF
80
90
򣇐
A31D0
򣇑
A31D1
򣇒
A31D2
򣇓
A31D3
򣇔
A31D4
򣇕
A31D5
򣇖
A31D6
򣇗
A31D7
򣇘
A31D8
򣇙
A31D9
򣇚
A31DA
򣇛
A31DB
򣇜
A31DC
򣇝
A31DD
򣇞
A31DE
򣇟
A31DF
90
A0
򣇠
A31E0
򣇡
A31E1
򣇢
A31E2
򣇣
A31E3
򣇤
A31E4
򣇥
A31E5
򣇦
A31E6
򣇧
A31E7
򣇨
A31E8
򣇩
A31E9
򣇪
A31EA
򣇫
A31EB
򣇬
A31EC
򣇭
A31ED
򣇮
A31EE
򣇯
A31EF
A0
B0
򣇰
A31F0
򣇱
A31F1
򣇲
A31F2
򣇳
A31F3
򣇴
A31F4
򣇵
A31F5
򣇶
A31F6
򣇷
A31F7
򣇸
A31F8
򣇹
A31F9
򣇺
A31FA
򣇻
A31FB
򣇼
A31FC
򣇽
A31FD
򣇾
A31FE
򣇿
A31FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]