International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1B894

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񸔀
78500
񸔁
78501
񸔂
78502
񸔃
78503
񸔄
78504
񸔅
78505
񸔆
78506
񸔇
78507
񸔈
78508
񸔉
78509
񸔊
7850A
񸔋
7850B
񸔌
7850C
񸔍
7850D
񸔎
7850E
񸔏
7850F
80
90
񸔐
78510
񸔑
78511
񸔒
78512
񸔓
78513
񸔔
78514
񸔕
78515
񸔖
78516
񸔗
78517
񸔘
78518
񸔙
78519
񸔚
7851A
񸔛
7851B
񸔜
7851C
񸔝
7851D
񸔞
7851E
񸔟
7851F
90
A0
񸔠
78520
񸔡
78521
񸔢
78522
񸔣
78523
񸔤
78524
񸔥
78525
񸔦
78526
񸔧
78527
񸔨
78528
񸔩
78529
񸔪
7852A
񸔫
7852B
񸔬
7852C
񸔭
7852D
񸔮
7852E
񸔯
7852F
A0
B0
񸔰
78530
񸔱
78531
񸔲
78532
񸔳
78533
񸔴
78534
񸔵
78535
񸔶
78536
񸔷
78537
񸔸
78538
񸔹
78539
񸔺
7853A
񸔻
7853B
񸔼
7853C
񸔽
7853D
񸔾
7853E
񸔿
7853F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]