International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3BF97

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󿗀
FF5C0
󿗁
FF5C1
󿗂
FF5C2
󿗃
FF5C3
󿗄
FF5C4
󿗅
FF5C5
󿗆
FF5C6
󿗇
FF5C7
󿗈
FF5C8
󿗉
FF5C9
󿗊
FF5CA
󿗋
FF5CB
󿗌
FF5CC
󿗍
FF5CD
󿗎
FF5CE
󿗏
FF5CF
80
90
󿗐
FF5D0
󿗑
FF5D1
󿗒
FF5D2
󿗓
FF5D3
󿗔
FF5D4
󿗕
FF5D5
󿗖
FF5D6
󿗗
FF5D7
󿗘
FF5D8
󿗙
FF5D9
󿗚
FF5DA
󿗛
FF5DB
󿗜
FF5DC
󿗝
FF5DD
󿗞
FF5DE
󿗟
FF5DF
90
A0
󿗠
FF5E0
󿗡
FF5E1
󿗢
FF5E2
󿗣
FF5E3
󿗤
FF5E4
󿗥
FF5E5
󿗦
FF5E6
󿗧
FF5E7
󿗨
FF5E8
󿗩
FF5E9
󿗪
FF5EA
󿗫
FF5EB
󿗬
FF5EC
󿗭
FF5ED
󿗮
FF5EE
󿗯
FF5EF
A0
B0
󿗰
FF5F0
󿗱
FF5F1
󿗲
FF5F2
󿗳
FF5F3
󿗴
FF5F4
󿗵
FF5F5
󿗶
FF5F6
󿗷
FF5F7
󿗸
FF5F8
󿗹
FF5F9
󿗺
FF5FA
󿗻
FF5FB
󿗼
FF5FC
󿗽
FF5FD
󿗾
FF5FE
󿗿
FF5FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]