International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F4848F

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􄏀
1043C0
􄏁
1043C1
􄏂
1043C2
􄏃
1043C3
􄏄
1043C4
􄏅
1043C5
􄏆
1043C6
􄏇
1043C7
􄏈
1043C8
􄏉
1043C9
􄏊
1043CA
􄏋
1043CB
􄏌
1043CC
􄏍
1043CD
􄏎
1043CE
􄏏
1043CF
80
90
􄏐
1043D0
􄏑
1043D1
􄏒
1043D2
􄏓
1043D3
􄏔
1043D4
􄏕
1043D5
􄏖
1043D6
􄏗
1043D7
􄏘
1043D8
􄏙
1043D9
􄏚
1043DA
􄏛
1043DB
􄏜
1043DC
􄏝
1043DD
􄏞
1043DE
􄏟
1043DF
90
A0
􄏠
1043E0
􄏡
1043E1
􄏢
1043E2
􄏣
1043E3
􄏤
1043E4
􄏥
1043E5
􄏦
1043E6
􄏧
1043E7
􄏨
1043E8
􄏩
1043E9
􄏪
1043EA
􄏫
1043EB
􄏬
1043EC
􄏭
1043ED
􄏮
1043EE
􄏯
1043EF
A0
B0
􄏰
1043F0
􄏱
1043F1
􄏲
1043F2
􄏳
1043F3
􄏴
1043F4
􄏵
1043F5
􄏶
1043F6
􄏷
1043F7
􄏸
1043F8
􄏹
1043F9
􄏺
1043FA
􄏻
1043FB
􄏼
1043FC
􄏽
1043FD
􄏾
1043FE
􄏿
1043FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]