International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29994

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򙔀
99500
򙔁
99501
򙔂
99502
򙔃
99503
򙔄
99504
򙔅
99505
򙔆
99506
򙔇
99507
򙔈
99508
򙔉
99509
򙔊
9950A
򙔋
9950B
򙔌
9950C
򙔍
9950D
򙔎
9950E
򙔏
9950F
80
90
򙔐
99510
򙔑
99511
򙔒
99512
򙔓
99513
򙔔
99514
򙔕
99515
򙔖
99516
򙔗
99517
򙔘
99518
򙔙
99519
򙔚
9951A
򙔛
9951B
򙔜
9951C
򙔝
9951D
򙔞
9951E
򙔟
9951F
90
A0
򙔠
99520
򙔡
99521
򙔢
99522
򙔣
99523
򙔤
99524
򙔥
99525
򙔦
99526
򙔧
99527
򙔨
99528
򙔩
99529
򙔪
9952A
򙔫
9952B
򙔬
9952C
򙔭
9952D
򙔮
9952E
򙔯
9952F
A0
B0
򙔰
99530
򙔱
99531
򙔲
99532
򙔳
99533
򙔴
99534
򙔵
99535
򙔶
99536
򙔷
99537
򙔸
99538
򙔹
99539
򙔺
9953A
򙔻
9953B
򙔼
9953C
򙔽
9953D
򙔾
9953E
򙔿
9953F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]