International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38B96

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󋖀
CB580
󋖁
CB581
󋖂
CB582
󋖃
CB583
󋖄
CB584
󋖅
CB585
󋖆
CB586
󋖇
CB587
󋖈
CB588
󋖉
CB589
󋖊
CB58A
󋖋
CB58B
󋖌
CB58C
󋖍
CB58D
󋖎
CB58E
󋖏
CB58F
80
90
󋖐
CB590
󋖑
CB591
󋖒
CB592
󋖓
CB593
󋖔
CB594
󋖕
CB595
󋖖
CB596
󋖗
CB597
󋖘
CB598
󋖙
CB599
󋖚
CB59A
󋖛
CB59B
󋖜
CB59C
󋖝
CB59D
󋖞
CB59E
󋖟
CB59F
90
A0
󋖠
CB5A0
󋖡
CB5A1
󋖢
CB5A2
󋖣
CB5A3
󋖤
CB5A4
󋖥
CB5A5
󋖦
CB5A6
󋖧
CB5A7
󋖨
CB5A8
󋖩
CB5A9
󋖪
CB5AA
󋖫
CB5AB
󋖬
CB5AC
󋖭
CB5AD
󋖮
CB5AE
󋖯
CB5AF
A0
B0
󋖰
CB5B0
󋖱
CB5B1
󋖲
CB5B2
󋖳
CB5B3
󋖴
CB5B4
󋖵
CB5B5
󋖶
CB5B6
󋖷
CB5B7
󋖸
CB5B8
󋖹
CB5B9
󋖺
CB5BA
󋖻
CB5BB
󋖼
CB5BC
󋖽
CB5BD
󋖾
CB5BE
󋖿
CB5BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]