International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2BD92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򽒀
BD480
򽒁
BD481
򽒂
BD482
򽒃
BD483
򽒄
BD484
򽒅
BD485
򽒆
BD486
򽒇
BD487
򽒈
BD488
򽒉
BD489
򽒊
BD48A
򽒋
BD48B
򽒌
BD48C
򽒍
BD48D
򽒎
BD48E
򽒏
BD48F
80
90
򽒐
BD490
򽒑
BD491
򽒒
BD492
򽒓
BD493
򽒔
BD494
򽒕
BD495
򽒖
BD496
򽒗
BD497
򽒘
BD498
򽒙
BD499
򽒚
BD49A
򽒛
BD49B
򽒜
BD49C
򽒝
BD49D
򽒞
BD49E
򽒟
BD49F
90
A0
򽒠
BD4A0
򽒡
BD4A1
򽒢
BD4A2
򽒣
BD4A3
򽒤
BD4A4
򽒥
BD4A5
򽒦
BD4A6
򽒧
BD4A7
򽒨
BD4A8
򽒩
BD4A9
򽒪
BD4AA
򽒫
BD4AB
򽒬
BD4AC
򽒭
BD4AD
򽒮
BD4AE
򽒯
BD4AF
A0
B0
򽒰
BD4B0
򽒱
BD4B1
򽒲
BD4B2
򽒳
BD4B3
򽒴
BD4B4
򽒵
BD4B5
򽒶
BD4B6
򽒷
BD4B7
򽒸
BD4B8
򽒹
BD4B9
򽒺
BD4BA
򽒻
BD4BB
򽒼
BD4BC
򽒽
BD4BD
򽒾
BD4BE
򽒿
BD4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]