International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38CB8

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󌸀
CCE00
󌸁
CCE01
󌸂
CCE02
󌸃
CCE03
󌸄
CCE04
󌸅
CCE05
󌸆
CCE06
󌸇
CCE07
󌸈
CCE08
󌸉
CCE09
󌸊
CCE0A
󌸋
CCE0B
󌸌
CCE0C
󌸍
CCE0D
󌸎
CCE0E
󌸏
CCE0F
80
90
󌸐
CCE10
󌸑
CCE11
󌸒
CCE12
󌸓
CCE13
󌸔
CCE14
󌸕
CCE15
󌸖
CCE16
󌸗
CCE17
󌸘
CCE18
󌸙
CCE19
󌸚
CCE1A
󌸛
CCE1B
󌸜
CCE1C
󌸝
CCE1D
󌸞
CCE1E
󌸟
CCE1F
90
A0
󌸠
CCE20
󌸡
CCE21
󌸢
CCE22
󌸣
CCE23
󌸤
CCE24
󌸥
CCE25
󌸦
CCE26
󌸧
CCE27
󌸨
CCE28
󌸩
CCE29
󌸪
CCE2A
󌸫
CCE2B
󌸬
CCE2C
󌸭
CCE2D
󌸮
CCE2E
󌸯
CCE2F
A0
B0
󌸰
CCE30
󌸱
CCE31
󌸲
CCE32
󌸳
CCE33
󌸴
CCE34
󌸵
CCE35
󌸶
CCE36
󌸷
CCE37
󌸸
CCE38
󌸹
CCE39
󌸺
CCE3A
󌸻
CCE3B
󌸼
CCE3C
󌸽
CCE3D
󌸾
CCE3E
󌸿
CCE3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]