International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F381AE

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󁮀
C1B80
󁮁
C1B81
󁮂
C1B82
󁮃
C1B83
󁮄
C1B84
󁮅
C1B85
󁮆
C1B86
󁮇
C1B87
󁮈
C1B88
󁮉
C1B89
󁮊
C1B8A
󁮋
C1B8B
󁮌
C1B8C
󁮍
C1B8D
󁮎
C1B8E
󁮏
C1B8F
80
90
󁮐
C1B90
󁮑
C1B91
󁮒
C1B92
󁮓
C1B93
󁮔
C1B94
󁮕
C1B95
󁮖
C1B96
󁮗
C1B97
󁮘
C1B98
󁮙
C1B99
󁮚
C1B9A
󁮛
C1B9B
󁮜
C1B9C
󁮝
C1B9D
󁮞
C1B9E
󁮟
C1B9F
90
A0
󁮠
C1BA0
󁮡
C1BA1
󁮢
C1BA2
󁮣
C1BA3
󁮤
C1BA4
󁮥
C1BA5
󁮦
C1BA6
󁮧
C1BA7
󁮨
C1BA8
󁮩
C1BA9
󁮪
C1BAA
󁮫
C1BAB
󁮬
C1BAC
󁮭
C1BAD
󁮮
C1BAE
󁮯
C1BAF
A0
B0
󁮰
C1BB0
󁮱
C1BB1
󁮲
C1BB2
󁮳
C1BB3
󁮴
C1BB4
󁮵
C1BB5
󁮶
C1BB6
󁮷
C1BB7
󁮸
C1BB8
󁮹
C1BB9
󁮺
C1BBA
󁮻
C1BBB
󁮼
C1BBC
󁮽
C1BBD
󁮾
C1BBE
󁮿
C1BBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]