International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A089

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򠉀
A0240
򠉁
A0241
򠉂
A0242
򠉃
A0243
򠉄
A0244
򠉅
A0245
򠉆
A0246
򠉇
A0247
򠉈
A0248
򠉉
A0249
򠉊
A024A
򠉋
A024B
򠉌
A024C
򠉍
A024D
򠉎
A024E
򠉏
A024F
80
90
򠉐
A0250
򠉑
A0251
򠉒
A0252
򠉓
A0253
򠉔
A0254
򠉕
A0255
򠉖
A0256
򠉗
A0257
򠉘
A0258
򠉙
A0259
򠉚
A025A
򠉛
A025B
򠉜
A025C
򠉝
A025D
򠉞
A025E
򠉟
A025F
90
A0
򠉠
A0260
򠉡
A0261
򠉢
A0262
򠉣
A0263
򠉤
A0264
򠉥
A0265
򠉦
A0266
򠉧
A0267
򠉨
A0268
򠉩
A0269
򠉪
A026A
򠉫
A026B
򠉬
A026C
򠉭
A026D
򠉮
A026E
򠉯
A026F
A0
B0
򠉰
A0270
򠉱
A0271
򠉲
A0272
򠉳
A0273
򠉴
A0274
򠉵
A0275
򠉶
A0276
򠉷
A0277
򠉸
A0278
򠉹
A0279
򠉺
A027A
򠉻
A027B
򠉼
A027C
򠉽
A027D
򠉾
A027E
򠉿
A027F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]