International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29590

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򕐀
95400
򕐁
95401
򕐂
95402
򕐃
95403
򕐄
95404
򕐅
95405
򕐆
95406
򕐇
95407
򕐈
95408
򕐉
95409
򕐊
9540A
򕐋
9540B
򕐌
9540C
򕐍
9540D
򕐎
9540E
򕐏
9540F
80
90
򕐐
95410
򕐑
95411
򕐒
95412
򕐓
95413
򕐔
95414
򕐕
95415
򕐖
95416
򕐗
95417
򕐘
95418
򕐙
95419
򕐚
9541A
򕐛
9541B
򕐜
9541C
򕐝
9541D
򕐞
9541E
򕐟
9541F
90
A0
򕐠
95420
򕐡
95421
򕐢
95422
򕐣
95423
򕐤
95424
򕐥
95425
򕐦
95426
򕐧
95427
򕐨
95428
򕐩
95429
򕐪
9542A
򕐫
9542B
򕐬
9542C
򕐭
9542D
򕐮
9542E
򕐯
9542F
A0
B0
򕐰
95430
򕐱
95431
򕐲
95432
򕐳
95433
򕐴
95434
򕐵
95435
򕐶
95436
򕐷
95437
򕐸
95438
򕐹
95439
򕐺
9543A
򕐻
9543B
򕐼
9543C
򕐽
9543D
򕐾
9543E
򕐿
9543F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]