International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A591

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󥑀
E5440
󥑁
E5441
󥑂
E5442
󥑃
E5443
󥑄
E5444
󥑅
E5445
󥑆
E5446
󥑇
E5447
󥑈
E5448
󥑉
E5449
󥑊
E544A
󥑋
E544B
󥑌
E544C
󥑍
E544D
󥑎
E544E
󥑏
E544F
80
90
󥑐
E5450
󥑑
E5451
󥑒
E5452
󥑓
E5453
󥑔
E5454
󥑕
E5455
󥑖
E5456
󥑗
E5457
󥑘
E5458
󥑙
E5459
󥑚
E545A
󥑛
E545B
󥑜
E545C
󥑝
E545D
󥑞
E545E
󥑟
E545F
90
A0
󥑠
E5460
󥑡
E5461
󥑢
E5462
󥑣
E5463
󥑤
E5464
󥑥
E5465
󥑦
E5466
󥑧
E5467
󥑨
E5468
󥑩
E5469
󥑪
E546A
󥑫
E546B
󥑬
E546C
󥑭
E546D
󥑮
E546E
󥑯
E546F
A0
B0
󥑰
E5470
󥑱
E5471
󥑲
E5472
󥑳
E5473
󥑴
E5474
󥑵
E5475
󥑶
E5476
󥑷
E5477
󥑸
E5478
󥑹
E5479
󥑺
E547A
󥑻
E547B
󥑼
E547C
󥑽
E547D
󥑾
E547E
󥑿
E547F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]