International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28098

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򀘀
80600
򀘁
80601
򀘂
80602
򀘃
80603
򀘄
80604
򀘅
80605
򀘆
80606
򀘇
80607
򀘈
80608
򀘉
80609
򀘊
8060A
򀘋
8060B
򀘌
8060C
򀘍
8060D
򀘎
8060E
򀘏
8060F
80
90
򀘐
80610
򀘑
80611
򀘒
80612
򀘓
80613
򀘔
80614
򀘕
80615
򀘖
80616
򀘗
80617
򀘘
80618
򀘙
80619
򀘚
8061A
򀘛
8061B
򀘜
8061C
򀘝
8061D
򀘞
8061E
򀘟
8061F
90
A0
򀘠
80620
򀘡
80621
򀘢
80622
򀘣
80623
򀘤
80624
򀘥
80625
򀘦
80626
򀘧
80627
򀘨
80628
򀘩
80629
򀘪
8062A
򀘫
8062B
򀘬
8062C
򀘭
8062D
򀘮
8062E
򀘯
8062F
A0
B0
򀘰
80630
򀘱
80631
򀘲
80632
򀘳
80633
򀘴
80634
򀘵
80635
򀘶
80636
򀘷
80637
򀘸
80638
򀘹
80639
򀘺
8063A
򀘻
8063B
򀘼
8063C
򀘽
8063D
򀘾
8063E
򀘿
8063F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]