International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B790

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𷐀
37400
𷐁
37401
𷐂
37402
𷐃
37403
𷐄
37404
𷐅
37405
𷐆
37406
𷐇
37407
𷐈
37408
𷐉
37409
𷐊
3740A
𷐋
3740B
𷐌
3740C
𷐍
3740D
𷐎
3740E
𷐏
3740F
80
90
𷐐
37410
𷐑
37411
𷐒
37412
𷐓
37413
𷐔
37414
𷐕
37415
𷐖
37416
𷐗
37417
𷐘
37418
𷐙
37419
𷐚
3741A
𷐛
3741B
𷐜
3741C
𷐝
3741D
𷐞
3741E
𷐟
3741F
90
A0
𷐠
37420
𷐡
37421
𷐢
37422
𷐣
37423
𷐤
37424
𷐥
37425
𷐦
37426
𷐧
37427
𷐨
37428
𷐩
37429
𷐪
3742A
𷐫
3742B
𷐬
3742C
𷐭
3742D
𷐮
3742E
𷐯
3742F
A0
B0
𷐰
37430
𷐱
37431
𷐲
37432
𷐳
37433
𷐴
37434
𷐵
37435
𷐶
37436
𷐷
37437
𷐸
37438
𷐹
37439
𷐺
3743A
𷐻
3743B
𷐼
3743C
𷐽
3743D
𷐾
3743E
𷐿
3743F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]