International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B48D

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𴍀
34340
𴍁
34341
𴍂
34342
𴍃
34343
𴍄
34344
𴍅
34345
𴍆
34346
𴍇
34347
𴍈
34348
𴍉
34349
𴍊
3434A
𴍋
3434B
𴍌
3434C
𴍍
3434D
𴍎
3434E
𴍏
3434F
80
90
𴍐
34350
𴍑
34351
𴍒
34352
𴍓
34353
𴍔
34354
𴍕
34355
𴍖
34356
𴍗
34357
𴍘
34358
𴍙
34359
𴍚
3435A
𴍛
3435B
𴍜
3435C
𴍝
3435D
𴍞
3435E
𴍟
3435F
90
A0
𴍠
34360
𴍡
34361
𴍢
34362
𴍣
34363
𴍤
34364
𴍥
34365
𴍦
34366
𴍧
34367
𴍨
34368
𴍩
34369
𴍪
3436A
𴍫
3436B
𴍬
3436C
𴍭
3436D
𴍮
3436E
𴍯
3436F
A0
B0
𴍰
34370
𴍱
34371
𴍲
34372
𴍳
34373
𴍴
34374
𴍵
34375
𴍶
34376
𴍷
34377
𴍸
34378
𴍹
34379
𴍺
3437A
𴍻
3437B
𴍼
3437C
𴍽
3437D
𴍾
3437E
𴍿
3437F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]