International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1B098

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񰘀
70600
񰘁
70601
񰘂
70602
񰘃
70603
񰘄
70604
񰘅
70605
񰘆
70606
񰘇
70607
񰘈
70608
񰘉
70609
񰘊
7060A
񰘋
7060B
񰘌
7060C
񰘍
7060D
񰘎
7060E
񰘏
7060F
80
90
񰘐
70610
񰘑
70611
񰘒
70612
񰘓
70613
񰘔
70614
񰘕
70615
񰘖
70616
񰘗
70617
񰘘
70618
񰘙
70619
񰘚
7061A
񰘛
7061B
񰘜
7061C
񰘝
7061D
񰘞
7061E
񰘟
7061F
90
A0
񰘠
70620
񰘡
70621
񰘢
70622
񰘣
70623
񰘤
70624
񰘥
70625
񰘦
70626
񰘧
70627
񰘨
70628
񰘩
70629
񰘪
7062A
񰘫
7062B
񰘬
7062C
񰘭
7062D
񰘮
7062E
񰘯
7062F
A0
B0
񰘰
70630
񰘱
70631
񰘲
70632
񰘳
70633
񰘴
70634
񰘵
70635
񰘶
70636
񰘷
70637
񰘸
70638
񰘹
70639
񰘺
7063A
񰘻
7063B
񰘼
7063C
񰘽
7063D
񰘾
7063E
񰘿
7063F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]