International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0AD92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𭒀
2D480
𭒁
2D481
𭒂
2D482
𭒃
2D483
𭒄
2D484
𭒅
2D485
𭒆
2D486
𭒇
2D487
𭒈
2D488
𭒉
2D489
𭒊
2D48A
𭒋
2D48B
𭒌
2D48C
𭒍
2D48D
𭒎
2D48E
𭒏
2D48F
80
90
𭒐
2D490
𭒑
2D491
𭒒
2D492
𭒓
2D493
𭒔
2D494
𭒕
2D495
𭒖
2D496
𭒗
2D497
𭒘
2D498
𭒙
2D499
𭒚
2D49A
𭒛
2D49B
𭒜
2D49C
𭒝
2D49D
𭒞
2D49E
𭒟
2D49F
90
A0
𭒠
2D4A0
𭒡
2D4A1
𭒢
2D4A2
𭒣
2D4A3
𭒤
2D4A4
𭒥
2D4A5
𭒦
2D4A6
𭒧
2D4A7
𭒨
2D4A8
𭒩
2D4A9
𭒪
2D4AA
𭒫
2D4AB
𭒬
2D4AC
𭒭
2D4AD
𭒮
2D4AE
𭒯
2D4AF
A0
B0
𭒰
2D4B0
𭒱
2D4B1
𭒲
2D4B2
𭒳
2D4B3
𭒴
2D4B4
𭒵
2D4B5
𭒶
2D4B6
𭒷
2D4B7
𭒸
2D4B8
𭒹
2D4B9
𭒺
2D4BA
𭒻
2D4BB
𭒼
2D4BC
𭒽
2D4BD
𭒾
2D4BE
𭒿
2D4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]