International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29698

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򖘀
96600
򖘁
96601
򖘂
96602
򖘃
96603
򖘄
96604
򖘅
96605
򖘆
96606
򖘇
96607
򖘈
96608
򖘉
96609
򖘊
9660A
򖘋
9660B
򖘌
9660C
򖘍
9660D
򖘎
9660E
򖘏
9660F
80
90
򖘐
96610
򖘑
96611
򖘒
96612
򖘓
96613
򖘔
96614
򖘕
96615
򖘖
96616
򖘗
96617
򖘘
96618
򖘙
96619
򖘚
9661A
򖘛
9661B
򖘜
9661C
򖘝
9661D
򖘞
9661E
򖘟
9661F
90
A0
򖘠
96620
򖘡
96621
򖘢
96622
򖘣
96623
򖘤
96624
򖘥
96625
򖘦
96626
򖘧
96627
򖘨
96628
򖘩
96629
򖘪
9662A
򖘫
9662B
򖘬
9662C
򖘭
9662D
򖘮
9662E
򖘯
9662F
A0
B0
򖘰
96630
򖘱
96631
򖘲
96632
򖘳
96633
򖘴
96634
򖘵
96635
򖘶
96636
򖘷
96637
򖘸
96638
򖘹
96639
򖘺
9663A
򖘻
9663B
򖘼
9663C
򖘽
9663D
򖘾
9663E
򖘿
9663F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]