International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2979A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򗚀
97680
򗚁
97681
򗚂
97682
򗚃
97683
򗚄
97684
򗚅
97685
򗚆
97686
򗚇
97687
򗚈
97688
򗚉
97689
򗚊
9768A
򗚋
9768B
򗚌
9768C
򗚍
9768D
򗚎
9768E
򗚏
9768F
80
90
򗚐
97690
򗚑
97691
򗚒
97692
򗚓
97693
򗚔
97694
򗚕
97695
򗚖
97696
򗚗
97697
򗚘
97698
򗚙
97699
򗚚
9769A
򗚛
9769B
򗚜
9769C
򗚝
9769D
򗚞
9769E
򗚟
9769F
90
A0
򗚠
976A0
򗚡
976A1
򗚢
976A2
򗚣
976A3
򗚤
976A4
򗚥
976A5
򗚦
976A6
򗚧
976A7
򗚨
976A8
򗚩
976A9
򗚪
976AA
򗚫
976AB
򗚬
976AC
򗚭
976AD
򗚮
976AE
򗚯
976AF
A0
B0
򗚰
976B0
򗚱
976B1
򗚲
976B2
򗚳
976B3
򗚴
976B4
򗚵
976B5
򗚶
976B6
򗚷
976B7
򗚸
976B8
򗚹
976B9
򗚺
976BA
򗚻
976BB
򗚼
976BC
򗚽
976BD
򗚾
976BE
򗚿
976BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]