International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A38D

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򣍀
A3340
򣍁
A3341
򣍂
A3342
򣍃
A3343
򣍄
A3344
򣍅
A3345
򣍆
A3346
򣍇
A3347
򣍈
A3348
򣍉
A3349
򣍊
A334A
򣍋
A334B
򣍌
A334C
򣍍
A334D
򣍎
A334E
򣍏
A334F
80
90
򣍐
A3350
򣍑
A3351
򣍒
A3352
򣍓
A3353
򣍔
A3354
򣍕
A3355
򣍖
A3356
򣍗
A3357
򣍘
A3358
򣍙
A3359
򣍚
A335A
򣍛
A335B
򣍜
A335C
򣍝
A335D
򣍞
A335E
򣍟
A335F
90
A0
򣍠
A3360
򣍡
A3361
򣍢
A3362
򣍣
A3363
򣍤
A3364
򣍥
A3365
򣍦
A3366
򣍧
A3367
򣍨
A3368
򣍩
A3369
򣍪
A336A
򣍫
A336B
򣍬
A336C
򣍭
A336D
򣍮
A336E
򣍯
A336F
A0
B0
򣍰
A3370
򣍱
A3371
򣍲
A3372
򣍳
A3373
򣍴
A3374
򣍵
A3375
򣍶
A3376
򣍷
A3377
򣍸
A3378
򣍹
A3379
򣍺
A337A
򣍻
A337B
򣍼
A337C
򣍽
A337D
򣍾
A337E
򣍿
A337F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]