International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F388BB

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󈻀
C8EC0
󈻁
C8EC1
󈻂
C8EC2
󈻃
C8EC3
󈻄
C8EC4
󈻅
C8EC5
󈻆
C8EC6
󈻇
C8EC7
󈻈
C8EC8
󈻉
C8EC9
󈻊
C8ECA
󈻋
C8ECB
󈻌
C8ECC
󈻍
C8ECD
󈻎
C8ECE
󈻏
C8ECF
80
90
󈻐
C8ED0
󈻑
C8ED1
󈻒
C8ED2
󈻓
C8ED3
󈻔
C8ED4
󈻕
C8ED5
󈻖
C8ED6
󈻗
C8ED7
󈻘
C8ED8
󈻙
C8ED9
󈻚
C8EDA
󈻛
C8EDB
󈻜
C8EDC
󈻝
C8EDD
󈻞
C8EDE
󈻟
C8EDF
90
A0
󈻠
C8EE0
󈻡
C8EE1
󈻢
C8EE2
󈻣
C8EE3
󈻤
C8EE4
󈻥
C8EE5
󈻦
C8EE6
󈻧
C8EE7
󈻨
C8EE8
󈻩
C8EE9
󈻪
C8EEA
󈻫
C8EEB
󈻬
C8EEC
󈻭
C8EED
󈻮
C8EEE
󈻯
C8EEF
A0
B0
󈻰
C8EF0
󈻱
C8EF1
󈻲
C8EF2
󈻳
C8EF3
󈻴
C8EF4
󈻵
C8EF5
󈻶
C8EF6
󈻷
C8EF7
󈻸
C8EF8
󈻹
C8EF9
󈻺
C8EFA
󈻻
C8EFB
󈻼
C8EFC
󈻽
C8EFD
󈻾
C8EFE
󈻿
C8EFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]