International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09C92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𜒀
1C480
𜒁
1C481
𜒂
1C482
𜒃
1C483
𜒄
1C484
𜒅
1C485
𜒆
1C486
𜒇
1C487
𜒈
1C488
𜒉
1C489
𜒊
1C48A
𜒋
1C48B
𜒌
1C48C
𜒍
1C48D
𜒎
1C48E
𜒏
1C48F
80
90
𜒐
1C490
𜒑
1C491
𜒒
1C492
𜒓
1C493
𜒔
1C494
𜒕
1C495
𜒖
1C496
𜒗
1C497
𜒘
1C498
𜒙
1C499
𜒚
1C49A
𜒛
1C49B
𜒜
1C49C
𜒝
1C49D
𜒞
1C49E
𜒟
1C49F
90
A0
𜒠
1C4A0
𜒡
1C4A1
𜒢
1C4A2
𜒣
1C4A3
𜒤
1C4A4
𜒥
1C4A5
𜒦
1C4A6
𜒧
1C4A7
𜒨
1C4A8
𜒩
1C4A9
𜒪
1C4AA
𜒫
1C4AB
𜒬
1C4AC
𜒭
1C4AD
𜒮
1C4AE
𜒯
1C4AF
A0
B0
𜒰
1C4B0
𜒱
1C4B1
𜒲
1C4B2
𜒳
1C4B3
𜒴
1C4B4
𜒵
1C4B5
𜒶
1C4B6
𜒷
1C4B7
𜒸
1C4B8
𜒹
1C4B9
𜒺
1C4BA
𜒻
1C4BB
𜒼
1C4BC
𜒽
1C4BD
𜒾
1C4BE
𜒿
1C4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]