International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0998B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𙋀
192C0
𙋁
192C1
𙋂
192C2
𙋃
192C3
𙋄
192C4
𙋅
192C5
𙋆
192C6
𙋇
192C7
𙋈
192C8
𙋉
192C9
𙋊
192CA
𙋋
192CB
𙋌
192CC
𙋍
192CD
𙋎
192CE
𙋏
192CF
80
90
𙋐
192D0
𙋑
192D1
𙋒
192D2
𙋓
192D3
𙋔
192D4
𙋕
192D5
𙋖
192D6
𙋗
192D7
𙋘
192D8
𙋙
192D9
𙋚
192DA
𙋛
192DB
𙋜
192DC
𙋝
192DD
𙋞
192DE
𙋟
192DF
90
A0
𙋠
192E0
𙋡
192E1
𙋢
192E2
𙋣
192E3
𙋤
192E4
𙋥
192E5
𙋦
192E6
𙋧
192E7
𙋨
192E8
𙋩
192E9
𙋪
192EA
𙋫
192EB
𙋬
192EC
𙋭
192ED
𙋮
192EE
𙋯
192EF
A0
B0
𙋰
192F0
𙋱
192F1
𙋲
192F2
𙋳
192F3
𙋴
192F4
𙋵
192F5
𙋶
192F6
𙋷
192F7
𙋸
192F8
𙋹
192F9
𙋺
192FA
𙋻
192FB
𙋼
192FC
𙋽
192FD
𙋾
192FE
𙋿
192FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]