International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28192

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򁒀
81480
򁒁
81481
򁒂
81482
򁒃
81483
򁒄
81484
򁒅
81485
򁒆
81486
򁒇
81487
򁒈
81488
򁒉
81489
򁒊
8148A
򁒋
8148B
򁒌
8148C
򁒍
8148D
򁒎
8148E
򁒏
8148F
80
90
򁒐
81490
򁒑
81491
򁒒
81492
򁒓
81493
򁒔
81494
򁒕
81495
򁒖
81496
򁒗
81497
򁒘
81498
򁒙
81499
򁒚
8149A
򁒛
8149B
򁒜
8149C
򁒝
8149D
򁒞
8149E
򁒟
8149F
90
A0
򁒠
814A0
򁒡
814A1
򁒢
814A2
򁒣
814A3
򁒤
814A4
򁒥
814A5
򁒦
814A6
򁒧
814A7
򁒨
814A8
򁒩
814A9
򁒪
814AA
򁒫
814AB
򁒬
814AC
򁒭
814AD
򁒮
814AE
򁒯
814AF
A0
B0
򁒰
814B0
򁒱
814B1
򁒲
814B2
򁒳
814B3
򁒴
814B4
򁒵
814B5
򁒶
814B6
򁒷
814B7
򁒸
814B8
򁒹
814B9
򁒺
814BA
򁒻
814BB
򁒼
814BC
򁒽
814BD
򁒾
814BE
򁒿
814BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]