International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38F94

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󏔀
CF500
󏔁
CF501
󏔂
CF502
󏔃
CF503
󏔄
CF504
󏔅
CF505
󏔆
CF506
󏔇
CF507
󏔈
CF508
󏔉
CF509
󏔊
CF50A
󏔋
CF50B
󏔌
CF50C
󏔍
CF50D
󏔎
CF50E
󏔏
CF50F
80
90
󏔐
CF510
󏔑
CF511
󏔒
CF512
󏔓
CF513
󏔔
CF514
󏔕
CF515
󏔖
CF516
󏔗
CF517
󏔘
CF518
󏔙
CF519
󏔚
CF51A
󏔛
CF51B
󏔜
CF51C
󏔝
CF51D
󏔞
CF51E
󏔟
CF51F
90
A0
󏔠
CF520
󏔡
CF521
󏔢
CF522
󏔣
CF523
󏔤
CF524
󏔥
CF525
󏔦
CF526
󏔧
CF527
󏔨
CF528
󏔩
CF529
󏔪
CF52A
󏔫
CF52B
󏔬
CF52C
󏔭
CF52D
󏔮
CF52E
󏔯
CF52F
A0
B0
󏔰
CF530
󏔱
CF531
󏔲
CF532
󏔳
CF533
󏔴
CF534
󏔵
CF535
󏔶
CF536
󏔷
CF537
󏔸
CF538
󏔹
CF539
󏔺
CF53A
󏔻
CF53B
󏔼
CF53C
󏔽
CF53D
󏔾
CF53E
󏔿
CF53F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]