International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48298

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􂘀
102600
􂘁
102601
􂘂
102602
􂘃
102603
􂘄
102604
􂘅
102605
􂘆
102606
􂘇
102607
􂘈
102608
􂘉
102609
􂘊
10260A
􂘋
10260B
􂘌
10260C
􂘍
10260D
􂘎
10260E
􂘏
10260F
80
90
􂘐
102610
􂘑
102611
􂘒
102612
􂘓
102613
􂘔
102614
􂘕
102615
􂘖
102616
􂘗
102617
􂘘
102618
􂘙
102619
􂘚
10261A
􂘛
10261B
􂘜
10261C
􂘝
10261D
􂘞
10261E
􂘟
10261F
90
A0
􂘠
102620
􂘡
102621
􂘢
102622
􂘣
102623
􂘤
102624
􂘥
102625
􂘦
102626
􂘧
102627
􂘨
102628
􂘩
102629
􂘪
10262A
􂘫
10262B
􂘬
10262C
􂘭
10262D
􂘮
10262E
􂘯
10262F
A0
B0
􂘰
102630
􂘱
102631
􂘲
102632
􂘳
102633
􂘴
102634
􂘵
102635
􂘶
102636
􂘷
102637
􂘸
102638
􂘹
102639
􂘺
10263A
􂘻
10263B
􂘼
10263C
􂘽
10263D
􂘾
10263E
􂘿
10263F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]