International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1A49C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񤜀
64700
񤜁
64701
񤜂
64702
񤜃
64703
񤜄
64704
񤜅
64705
񤜆
64706
񤜇
64707
񤜈
64708
񤜉
64709
񤜊
6470A
񤜋
6470B
񤜌
6470C
񤜍
6470D
񤜎
6470E
񤜏
6470F
80
90
񤜐
64710
񤜑
64711
񤜒
64712
񤜓
64713
񤜔
64714
񤜕
64715
񤜖
64716
񤜗
64717
񤜘
64718
񤜙
64719
񤜚
6471A
񤜛
6471B
񤜜
6471C
񤜝
6471D
񤜞
6471E
񤜟
6471F
90
A0
񤜠
64720
񤜡
64721
񤜢
64722
񤜣
64723
񤜤
64724
񤜥
64725
񤜦
64726
񤜧
64727
񤜨
64728
񤜩
64729
񤜪
6472A
񤜫
6472B
񤜬
6472C
񤜭
6472D
񤜮
6472E
񤜯
6472F
A0
B0
񤜰
64730
񤜱
64731
񤜲
64732
񤜳
64733
񤜴
64734
񤜵
64735
񤜶
64736
񤜷
64737
񤜸
64738
񤜹
64739
񤜺
6473A
񤜻
6473B
񤜼
6473C
񤜽
6473D
񤜾
6473E
񤜿
6473F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]