International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39C92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󜒀
DC480
󜒁
DC481
󜒂
DC482
󜒃
DC483
󜒄
DC484
󜒅
DC485
󜒆
DC486
󜒇
DC487
󜒈
DC488
󜒉
DC489
󜒊
DC48A
󜒋
DC48B
󜒌
DC48C
󜒍
DC48D
󜒎
DC48E
󜒏
DC48F
80
90
󜒐
DC490
󜒑
DC491
󜒒
DC492
󜒓
DC493
󜒔
DC494
󜒕
DC495
󜒖
DC496
󜒗
DC497
󜒘
DC498
󜒙
DC499
󜒚
DC49A
󜒛
DC49B
󜒜
DC49C
󜒝
DC49D
󜒞
DC49E
󜒟
DC49F
90
A0
󜒠
DC4A0
󜒡
DC4A1
󜒢
DC4A2
󜒣
DC4A3
󜒤
DC4A4
󜒥
DC4A5
󜒦
DC4A6
󜒧
DC4A7
󜒨
DC4A8
󜒩
DC4A9
󜒪
DC4AA
󜒫
DC4AB
󜒬
DC4AC
󜒭
DC4AD
󜒮
DC4AE
󜒯
DC4AF
A0
B0
󜒰
DC4B0
󜒱
DC4B1
󜒲
DC4B2
󜒳
DC4B3
󜒴
DC4B4
󜒵
DC4B5
󜒶
DC4B6
󜒷
DC4B7
󜒸
DC4B8
󜒹
DC4B9
󜒺
DC4BA
󜒻
DC4BB
󜒼
DC4BC
󜒽
DC4BD
󜒾
DC4BE
󜒿
DC4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]