International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38C96

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󌖀
CC580
󌖁
CC581
󌖂
CC582
󌖃
CC583
󌖄
CC584
󌖅
CC585
󌖆
CC586
󌖇
CC587
󌖈
CC588
󌖉
CC589
󌖊
CC58A
󌖋
CC58B
󌖌
CC58C
󌖍
CC58D
󌖎
CC58E
󌖏
CC58F
80
90
󌖐
CC590
󌖑
CC591
󌖒
CC592
󌖓
CC593
󌖔
CC594
󌖕
CC595
󌖖
CC596
󌖗
CC597
󌖘
CC598
󌖙
CC599
󌖚
CC59A
󌖛
CC59B
󌖜
CC59C
󌖝
CC59D
󌖞
CC59E
󌖟
CC59F
90
A0
󌖠
CC5A0
󌖡
CC5A1
󌖢
CC5A2
󌖣
CC5A3
󌖤
CC5A4
󌖥
CC5A5
󌖦
CC5A6
󌖧
CC5A7
󌖨
CC5A8
󌖩
CC5A9
󌖪
CC5AA
󌖫
CC5AB
󌖬
CC5AC
󌖭
CC5AD
󌖮
CC5AE
󌖯
CC5AF
A0
B0
󌖰
CC5B0
󌖱
CC5B1
󌖲
CC5B2
󌖳
CC5B3
󌖴
CC5B4
󌖵
CC5B5
󌖶
CC5B6
󌖷
CC5B7
󌖸
CC5B8
󌖹
CC5B9
󌖺
CC5BA
󌖻
CC5BB
󌖼
CC5BC
󌖽
CC5BD
󌖾
CC5BE
󌖿
CC5BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]