International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA MIME
UTF-8 UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F489BB

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􉻀
109EC0
􉻁
109EC1
􉻂
109EC2
􉻃
109EC3
􉻄
109EC4
􉻅
109EC5
􉻆
109EC6
􉻇
109EC7
􉻈
109EC8
􉻉
109EC9
􉻊
109ECA
􉻋
109ECB
􉻌
109ECC
􉻍
109ECD
􉻎
109ECE
􉻏
109ECF
80
90
􉻐
109ED0
􉻑
109ED1
􉻒
109ED2
􉻓
109ED3
􉻔
109ED4
􉻕
109ED5
􉻖
109ED6
􉻗
109ED7
􉻘
109ED8
􉻙
109ED9
􉻚
109EDA
􉻛
109EDB
􉻜
109EDC
􉻝
109EDD
􉻞
109EDE
􉻟
109EDF
90
A0
􉻠
109EE0
􉻡
109EE1
􉻢
109EE2
􉻣
109EE3
􉻤
109EE4
􉻥
109EE5
􉻦
109EE6
􉻧
109EE7
􉻨
109EE8
􉻩
109EE9
􉻪
109EEA
􉻫
109EEB
􉻬
109EEC
􉻭
109EED
􉻮
109EEE
􉻯
109EEF
A0
B0
􉻰
109EF0
􉻱
109EF1
􉻲
109EF2
􉻳
109EF3
􉻴
109EF4
􉻵
109EF5
􉻶
109EF6
􉻷
109EF7
􉻸
109EF8
􉻹
109EF9
􉻺
109EFA
􉻻
109EFB
􉻼
109EFC
􉻽
109EFD
􉻾
109EFE
􉻿
109EFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]