International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29987

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򙇀
991C0
򙇁
991C1
򙇂
991C2
򙇃
991C3
򙇄
991C4
򙇅
991C5
򙇆
991C6
򙇇
991C7
򙇈
991C8
򙇉
991C9
򙇊
991CA
򙇋
991CB
򙇌
991CC
򙇍
991CD
򙇎
991CE
򙇏
991CF
80
90
򙇐
991D0
򙇑
991D1
򙇒
991D2
򙇓
991D3
򙇔
991D4
򙇕
991D5
򙇖
991D6
򙇗
991D7
򙇘
991D8
򙇙
991D9
򙇚
991DA
򙇛
991DB
򙇜
991DC
򙇝
991DD
򙇞
991DE
򙇟
991DF
90
A0
򙇠
991E0
򙇡
991E1
򙇢
991E2
򙇣
991E3
򙇤
991E4
򙇥
991E5
򙇦
991E6
򙇧
991E7
򙇨
991E8
򙇩
991E9
򙇪
991EA
򙇫
991EB
򙇬
991EC
򙇭
991ED
򙇮
991EE
򙇯
991EF
A0
B0
򙇰
991F0
򙇱
991F1
򙇲
991F2
򙇳
991F3
򙇴
991F4
򙇵
991F5
򙇶
991F6
򙇷
991F7
򙇸
991F8
򙇹
991F9
򙇺
991FA
򙇻
991FB
򙇼
991FC
򙇽
991FD
򙇾
991FE
򙇿
991FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]