International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0A998

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𩘀
29600
𩘁
29601
𩘂
29602
𩘃
29603
𩘄
29604
𩘅
29605
𩘆
29606
𩘇
29607
𩘈
29608
𩘉
29609
𩘊
2960A
𩘋
2960B
𩘌
2960C
𩘍
2960D
𩘎
2960E
𩘏
2960F
80
90
𩘐
29610
𩘑
29611
𩘒
29612
𩘓
29613
𩘔
29614
𩘕
29615
𩘖
29616
𩘗
29617
𩘘
29618
𩘙
29619
𩘚
2961A
𩘛
2961B
𩘜
2961C
𩘝
2961D
𩘞
2961E
𩘟
2961F
90
A0
𩘠
29620
𩘡
29621
𩘢
29622
𩘣
29623
𩘤
29624
𩘥
29625
𩘦
29626
𩘧
29627
𩘨
29628
𩘩
29629
𩘪
2962A
𩘫
2962B
𩘬
2962C
𩘭
2962D
𩘮
2962E
𩘯
2962F
A0
B0
𩘰
29630
𩘱
29631
𩘲
29632
𩘳
29633
𩘴
29634
𩘵
29635
𩘶
29636
𩘷
29637
𩘸
29638
𩘹
29639
𩘺
2963A
𩘻
2963B
𩘼
2963C
𩘽
2963D
𩘾
2963E
𩘿
2963F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]