International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A990

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򩐀
A9400
򩐁
A9401
򩐂
A9402
򩐃
A9403
򩐄
A9404
򩐅
A9405
򩐆
A9406
򩐇
A9407
򩐈
A9408
򩐉
A9409
򩐊
A940A
򩐋
A940B
򩐌
A940C
򩐍
A940D
򩐎
A940E
򩐏
A940F
80
90
򩐐
A9410
򩐑
A9411
򩐒
A9412
򩐓
A9413
򩐔
A9414
򩐕
A9415
򩐖
A9416
򩐗
A9417
򩐘
A9418
򩐙
A9419
򩐚
A941A
򩐛
A941B
򩐜
A941C
򩐝
A941D
򩐞
A941E
򩐟
A941F
90
A0
򩐠
A9420
򩐡
A9421
򩐢
A9422
򩐣
A9423
򩐤
A9424
򩐥
A9425
򩐦
A9426
򩐧
A9427
򩐨
A9428
򩐩
A9429
򩐪
A942A
򩐫
A942B
򩐬
A942C
򩐭
A942D
򩐮
A942E
򩐯
A942F
A0
B0
򩐰
A9430
򩐱
A9431
򩐲
A9432
򩐳
A9433
򩐴
A9434
򩐵
A9435
򩐶
A9436
򩐷
A9437
򩐸
A9438
򩐹
A9439
򩐺
A943A
򩐻
A943B
򩐼
A943C
򩐽
A943D
򩐾
A943E
򩐿
A943F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]