International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2ADB6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򭶀
ADD80
򭶁
ADD81
򭶂
ADD82
򭶃
ADD83
򭶄
ADD84
򭶅
ADD85
򭶆
ADD86
򭶇
ADD87
򭶈
ADD88
򭶉
ADD89
򭶊
ADD8A
򭶋
ADD8B
򭶌
ADD8C
򭶍
ADD8D
򭶎
ADD8E
򭶏
ADD8F
80
90
򭶐
ADD90
򭶑
ADD91
򭶒
ADD92
򭶓
ADD93
򭶔
ADD94
򭶕
ADD95
򭶖
ADD96
򭶗
ADD97
򭶘
ADD98
򭶙
ADD99
򭶚
ADD9A
򭶛
ADD9B
򭶜
ADD9C
򭶝
ADD9D
򭶞
ADD9E
򭶟
ADD9F
90
A0
򭶠
ADDA0
򭶡
ADDA1
򭶢
ADDA2
򭶣
ADDA3
򭶤
ADDA4
򭶥
ADDA5
򭶦
ADDA6
򭶧
ADDA7
򭶨
ADDA8
򭶩
ADDA9
򭶪
ADDAA
򭶫
ADDAB
򭶬
ADDAC
򭶭
ADDAD
򭶮
ADDAE
򭶯
ADDAF
A0
B0
򭶰
ADDB0
򭶱
ADDB1
򭶲
ADDB2
򭶳
ADDB3
򭶴
ADDB4
򭶵
ADDB5
򭶶
ADDB6
򭶷
ADDB7
򭶸
ADDB8
򭶹
ADDB9
򭶺
ADDBA
򭶻
ADDBB
򭶼
ADDBC
򭶽
ADDBD
򭶾
ADDBE
򭶿
ADDBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]