International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A0BB

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󠻀
E0EC0
󠻁
E0EC1
󠻂
E0EC2
󠻃
E0EC3
󠻄
E0EC4
󠻅
E0EC5
󠻆
E0EC6
󠻇
E0EC7
󠻈
E0EC8
󠻉
E0EC9
󠻊
E0ECA
󠻋
E0ECB
󠻌
E0ECC
󠻍
E0ECD
󠻎
E0ECE
󠻏
E0ECF
80
90
󠻐
E0ED0
󠻑
E0ED1
󠻒
E0ED2
󠻓
E0ED3
󠻔
E0ED4
󠻕
E0ED5
󠻖
E0ED6
󠻗
E0ED7
󠻘
E0ED8
󠻙
E0ED9
󠻚
E0EDA
󠻛
E0EDB
󠻜
E0EDC
󠻝
E0EDD
󠻞
E0EDE
󠻟
E0EDF
90
A0
󠻠
E0EE0
󠻡
E0EE1
󠻢
E0EE2
󠻣
E0EE3
󠻤
E0EE4
󠻥
E0EE5
󠻦
E0EE6
󠻧
E0EE7
󠻨
E0EE8
󠻩
E0EE9
󠻪
E0EEA
󠻫
E0EEB
󠻬
E0EEC
󠻭
E0EED
󠻮
E0EEE
󠻯
E0EEF
A0
B0
󠻰
E0EF0
󠻱
E0EF1
󠻲
E0EF2
󠻳
E0EF3
󠻴
E0EF4
󠻵
E0EF5
󠻶
E0EF6
󠻷
E0EF7
󠻸
E0EF8
󠻹
E0EF9
󠻺
E0EFA
󠻻
E0EFB
󠻼
E0EFC
󠻽
E0EFD
󠻾
E0EFE
󠻿
E0EFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]