International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39089

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󐉀
D0240
󐉁
D0241
󐉂
D0242
󐉃
D0243
󐉄
D0244
󐉅
D0245
󐉆
D0246
󐉇
D0247
󐉈
D0248
󐉉
D0249
󐉊
D024A
󐉋
D024B
󐉌
D024C
󐉍
D024D
󐉎
D024E
󐉏
D024F
80
90
󐉐
D0250
󐉑
D0251
󐉒
D0252
󐉓
D0253
󐉔
D0254
󐉕
D0255
󐉖
D0256
󐉗
D0257
󐉘
D0258
󐉙
D0259
󐉚
D025A
󐉛
D025B
󐉜
D025C
󐉝
D025D
󐉞
D025E
󐉟
D025F
90
A0
󐉠
D0260
󐉡
D0261
󐉢
D0262
󐉣
D0263
󐉤
D0264
󐉥
D0265
󐉦
D0266
󐉧
D0267
󐉨
D0268
󐉩
D0269
󐉪
D026A
󐉫
D026B
󐉬
D026C
󐉭
D026D
󐉮
D026E
󐉯
D026F
A0
B0
󐉰
D0270
󐉱
D0271
󐉲
D0272
󐉳
D0273
󐉴
D0274
󐉵
D0275
󐉶
D0276
󐉷
D0277
󐉸
D0278
󐉹
D0279
󐉺
D027A
󐉻
D027B
󐉼
D027C
󐉽
D027D
󐉾
D027E
󐉿
D027F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]