International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A480

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򤀀
A4000
򤀁
A4001
򤀂
A4002
򤀃
A4003
򤀄
A4004
򤀅
A4005
򤀆
A4006
򤀇
A4007
򤀈
A4008
򤀉
A4009
򤀊
A400A
򤀋
A400B
򤀌
A400C
򤀍
A400D
򤀎
A400E
򤀏
A400F
80
90
򤀐
A4010
򤀑
A4011
򤀒
A4012
򤀓
A4013
򤀔
A4014
򤀕
A4015
򤀖
A4016
򤀗
A4017
򤀘
A4018
򤀙
A4019
򤀚
A401A
򤀛
A401B
򤀜
A401C
򤀝
A401D
򤀞
A401E
򤀟
A401F
90
A0
򤀠
A4020
򤀡
A4021
򤀢
A4022
򤀣
A4023
򤀤
A4024
򤀥
A4025
򤀦
A4026
򤀧
A4027
򤀨
A4028
򤀩
A4029
򤀪
A402A
򤀫
A402B
򤀬
A402C
򤀭
A402D
򤀮
A402E
򤀯
A402F
A0
B0
򤀰
A4030
򤀱
A4031
򤀲
A4032
򤀳
A4033
򤀴
A4034
򤀵
A4035
򤀶
A4036
򤀷
A4037
򤀸
A4038
򤀹
A4039
򤀺
A403A
򤀻
A403B
򤀼
A403C
򤀽
A403D
򤀾
A403E
򤀿
A403F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]