International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38E90

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󎐀
CE400
󎐁
CE401
󎐂
CE402
󎐃
CE403
󎐄
CE404
󎐅
CE405
󎐆
CE406
󎐇
CE407
󎐈
CE408
󎐉
CE409
󎐊
CE40A
󎐋
CE40B
󎐌
CE40C
󎐍
CE40D
󎐎
CE40E
󎐏
CE40F
80
90
󎐐
CE410
󎐑
CE411
󎐒
CE412
󎐓
CE413
󎐔
CE414
󎐕
CE415
󎐖
CE416
󎐗
CE417
󎐘
CE418
󎐙
CE419
󎐚
CE41A
󎐛
CE41B
󎐜
CE41C
󎐝
CE41D
󎐞
CE41E
󎐟
CE41F
90
A0
󎐠
CE420
󎐡
CE421
󎐢
CE422
󎐣
CE423
󎐤
CE424
󎐥
CE425
󎐦
CE426
󎐧
CE427
󎐨
CE428
󎐩
CE429
󎐪
CE42A
󎐫
CE42B
󎐬
CE42C
󎐭
CE42D
󎐮
CE42E
󎐯
CE42F
A0
B0
󎐰
CE430
󎐱
CE431
󎐲
CE432
󎐳
CE433
󎐴
CE434
󎐵
CE435
󎐶
CE436
󎐷
CE437
󎐸
CE438
󎐹
CE439
󎐺
CE43A
󎐻
CE43B
󎐼
CE43C
󎐽
CE43D
󎐾
CE43E
󎐿
CE43F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]