International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48A90

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􊐀
10A400
􊐁
10A401
􊐂
10A402
􊐃
10A403
􊐄
10A404
􊐅
10A405
􊐆
10A406
􊐇
10A407
􊐈
10A408
􊐉
10A409
􊐊
10A40A
􊐋
10A40B
􊐌
10A40C
􊐍
10A40D
􊐎
10A40E
􊐏
10A40F
80
90
􊐐
10A410
􊐑
10A411
􊐒
10A412
􊐓
10A413
􊐔
10A414
􊐕
10A415
􊐖
10A416
􊐗
10A417
􊐘
10A418
􊐙
10A419
􊐚
10A41A
􊐛
10A41B
􊐜
10A41C
􊐝
10A41D
􊐞
10A41E
􊐟
10A41F
90
A0
􊐠
10A420
􊐡
10A421
􊐢
10A422
􊐣
10A423
􊐤
10A424
􊐥
10A425
􊐦
10A426
􊐧
10A427
􊐨
10A428
􊐩
10A429
􊐪
10A42A
􊐫
10A42B
􊐬
10A42C
􊐭
10A42D
􊐮
10A42E
􊐯
10A42F
A0
B0
􊐰
10A430
􊐱
10A431
􊐲
10A432
􊐳
10A433
􊐴
10A434
􊐵
10A435
􊐶
10A436
􊐷
10A437
􊐸
10A438
􊐹
10A439
􊐺
10A43A
􊐻
10A43B
􊐼
10A43C
􊐽
10A43D
􊐾
10A43E
􊐿
10A43F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]