International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A398

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򣘀
A3600
򣘁
A3601
򣘂
A3602
򣘃
A3603
򣘄
A3604
򣘅
A3605
򣘆
A3606
򣘇
A3607
򣘈
A3608
򣘉
A3609
򣘊
A360A
򣘋
A360B
򣘌
A360C
򣘍
A360D
򣘎
A360E
򣘏
A360F
80
90
򣘐
A3610
򣘑
A3611
򣘒
A3612
򣘓
A3613
򣘔
A3614
򣘕
A3615
򣘖
A3616
򣘗
A3617
򣘘
A3618
򣘙
A3619
򣘚
A361A
򣘛
A361B
򣘜
A361C
򣘝
A361D
򣘞
A361E
򣘟
A361F
90
A0
򣘠
A3620
򣘡
A3621
򣘢
A3622
򣘣
A3623
򣘤
A3624
򣘥
A3625
򣘦
A3626
򣘧
A3627
򣘨
A3628
򣘩
A3629
򣘪
A362A
򣘫
A362B
򣘬
A362C
򣘭
A362D
򣘮
A362E
򣘯
A362F
A0
B0
򣘰
A3630
򣘱
A3631
򣘲
A3632
򣘳
A3633
򣘴
A3634
򣘵
A3635
򣘶
A3636
򣘷
A3637
򣘸
A3638
򣘹
A3639
򣘺
A363A
򣘻
A363B
򣘼
A363C
򣘽
A363D
򣘾
A363E
򣘿
A363F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]