International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28F92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򏒀
8F480
򏒁
8F481
򏒂
8F482
򏒃
8F483
򏒄
8F484
򏒅
8F485
򏒆
8F486
򏒇
8F487
򏒈
8F488
򏒉
8F489
򏒊
8F48A
򏒋
8F48B
򏒌
8F48C
򏒍
8F48D
򏒎
8F48E
򏒏
8F48F
80
90
򏒐
8F490
򏒑
8F491
򏒒
8F492
򏒓
8F493
򏒔
8F494
򏒕
8F495
򏒖
8F496
򏒗
8F497
򏒘
8F498
򏒙
8F499
򏒚
8F49A
򏒛
8F49B
򏒜
8F49C
򏒝
8F49D
򏒞
8F49E
򏒟
8F49F
90
A0
򏒠
8F4A0
򏒡
8F4A1
򏒢
8F4A2
򏒣
8F4A3
򏒤
8F4A4
򏒥
8F4A5
򏒦
8F4A6
򏒧
8F4A7
򏒨
8F4A8
򏒩
8F4A9
򏒪
8F4AA
򏒫
8F4AB
򏒬
8F4AC
򏒭
8F4AD
򏒮
8F4AE
򏒯
8F4AF
A0
B0
򏒰
8F4B0
򏒱
8F4B1
򏒲
8F4B2
򏒳
8F4B3
򏒴
8F4B4
򏒵
8F4B5
򏒶
8F4B6
򏒷
8F4B7
򏒸
8F4B8
򏒹
8F4B9
򏒺
8F4BA
򏒻
8F4BB
򏒼
8F4BC
򏒽
8F4BD
򏒾
8F4BE
򏒿
8F4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]