International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2BF90

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򿐀
BF400
򿐁
BF401
򿐂
BF402
򿐃
BF403
򿐄
BF404
򿐅
BF405
򿐆
BF406
򿐇
BF407
򿐈
BF408
򿐉
BF409
򿐊
BF40A
򿐋
BF40B
򿐌
BF40C
򿐍
BF40D
򿐎
BF40E
򿐏
BF40F
80
90
򿐐
BF410
򿐑
BF411
򿐒
BF412
򿐓
BF413
򿐔
BF414
򿐕
BF415
򿐖
BF416
򿐗
BF417
򿐘
BF418
򿐙
BF419
򿐚
BF41A
򿐛
BF41B
򿐜
BF41C
򿐝
BF41D
򿐞
BF41E
򿐟
BF41F
90
A0
򿐠
BF420
򿐡
BF421
򿐢
BF422
򿐣
BF423
򿐤
BF424
򿐥
BF425
򿐦
BF426
򿐧
BF427
򿐨
BF428
򿐩
BF429
򿐪
BF42A
򿐫
BF42B
򿐬
BF42C
򿐭
BF42D
򿐮
BF42E
򿐯
BF42F
A0
B0
򿐰
BF430
򿐱
BF431
򿐲
BF432
򿐳
BF433
򿐴
BF434
򿐵
BF435
򿐶
BF436
򿐷
BF437
򿐸
BF438
򿐹
BF439
򿐺
BF43A
򿐻
BF43B
򿐼
BF43C
򿐽
BF43D
򿐾
BF43E
򿐿
BF43F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]