International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F091AE

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𑮀
11B80
𑮁
11B81
𑮂
11B82
𑮃
11B83
𑮄
11B84
𑮅
11B85
𑮆
11B86
𑮇
11B87
𑮈
11B88
𑮉
11B89
𑮊
11B8A
𑮋
11B8B
𑮌
11B8C
𑮍
11B8D
𑮎
11B8E
𑮏
11B8F
80
90
𑮐
11B90
𑮑
11B91
𑮒
11B92
𑮓
11B93
𑮔
11B94
𑮕
11B95
𑮖
11B96
𑮗
11B97
𑮘
11B98
𑮙
11B99
𑮚
11B9A
𑮛
11B9B
𑮜
11B9C
𑮝
11B9D
𑮞
11B9E
𑮟
11B9F
90
A0
𑮠
11BA0
𑮡
11BA1
𑮢
11BA2
𑮣
11BA3
𑮤
11BA4
𑮥
11BA5
𑮦
11BA6
𑮧
11BA7
𑮨
11BA8
𑮩
11BA9
𑮪
11BAA
𑮫
11BAB
𑮬
11BAC
𑮭
11BAD
𑮮
11BAE
𑮯
11BAF
A0
B0
𑮰
11BB0
𑮱
11BB1
𑮲
11BB2
𑮳
11BB3
𑮴
11BB4
𑮵
11BB5
𑮶
11BB6
𑮷
11BB7
𑮸
11BB8
𑮹
11BB9
𑮺
11BBA
𑮻
11BBB
𑮼
11BBC
𑮽
11BBD
𑮾
11BBE
𑮿
11BBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]