International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48FBF

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􏿀
10FFC0
􏿁
10FFC1
􏿂
10FFC2
􏿃
10FFC3
􏿄
10FFC4
􏿅
10FFC5
􏿆
10FFC6
􏿇
10FFC7
􏿈
10FFC8
􏿉
10FFC9
􏿊
10FFCA
􏿋
10FFCB
􏿌
10FFCC
􏿍
10FFCD
􏿎
10FFCE
􏿏
10FFCF
80
90
􏿐
10FFD0
􏿑
10FFD1
􏿒
10FFD2
􏿓
10FFD3
􏿔
10FFD4
􏿕
10FFD5
􏿖
10FFD6
􏿗
10FFD7
􏿘
10FFD8
􏿙
10FFD9
􏿚
10FFDA
􏿛
10FFDB
􏿜
10FFDC
􏿝
10FFDD
􏿞
10FFDE
􏿟
10FFDF
90
A0
􏿠
10FFE0
􏿡
10FFE1
􏿢
10FFE2
􏿣
10FFE3
􏿤
10FFE4
􏿥
10FFE5
􏿦
10FFE6
􏿧
10FFE7
􏿨
10FFE8
􏿩
10FFE9
􏿪
10FFEA
􏿫
10FFEB
􏿬
10FFEC
􏿭
10FFED
􏿮
10FFEE
􏿯
10FFEF
A0
B0
􏿰
10FFF0
􏿱
10FFF1
􏿲
10FFF2
􏿳
10FFF3
􏿴
10FFF4
􏿵
10FFF5
􏿶
10FFF6
􏿷
10FFF7
􏿸
10FFF8
􏿹
10FFF9
􏿺
10FFFA
􏿻
10FFFB
􏿼
10FFFC
􏿽
10FFFD
􏿾
10FFFE
􏿿
10FFFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]