International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F382AE

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󂮀
C2B80
󂮁
C2B81
󂮂
C2B82
󂮃
C2B83
󂮄
C2B84
󂮅
C2B85
󂮆
C2B86
󂮇
C2B87
󂮈
C2B88
󂮉
C2B89
󂮊
C2B8A
󂮋
C2B8B
󂮌
C2B8C
󂮍
C2B8D
󂮎
C2B8E
󂮏
C2B8F
80
90
󂮐
C2B90
󂮑
C2B91
󂮒
C2B92
󂮓
C2B93
󂮔
C2B94
󂮕
C2B95
󂮖
C2B96
󂮗
C2B97
󂮘
C2B98
󂮙
C2B99
󂮚
C2B9A
󂮛
C2B9B
󂮜
C2B9C
󂮝
C2B9D
󂮞
C2B9E
󂮟
C2B9F
90
A0
󂮠
C2BA0
󂮡
C2BA1
󂮢
C2BA2
󂮣
C2BA3
󂮤
C2BA4
󂮥
C2BA5
󂮦
C2BA6
󂮧
C2BA7
󂮨
C2BA8
󂮩
C2BA9
󂮪
C2BAA
󂮫
C2BAB
󂮬
C2BAC
󂮭
C2BAD
󂮮
C2BAE
󂮯
C2BAF
A0
B0
󂮰
C2BB0
󂮱
C2BB1
󂮲
C2BB2
󂮳
C2BB3
󂮴
C2BB4
󂮵
C2BB5
󂮶
C2BB6
󂮷
C2BB7
󂮸
C2BB8
󂮹
C2BB9
󂮺
C2BBA
󂮻
C2BBB
󂮼
C2BBC
󂮽
C2BBD
󂮾
C2BBE
󂮿
C2BBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]