International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29C9A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򜚀
9C680
򜚁
9C681
򜚂
9C682
򜚃
9C683
򜚄
9C684
򜚅
9C685
򜚆
9C686
򜚇
9C687
򜚈
9C688
򜚉
9C689
򜚊
9C68A
򜚋
9C68B
򜚌
9C68C
򜚍
9C68D
򜚎
9C68E
򜚏
9C68F
80
90
򜚐
9C690
򜚑
9C691
򜚒
9C692
򜚓
9C693
򜚔
9C694
򜚕
9C695
򜚖
9C696
򜚗
9C697
򜚘
9C698
򜚙
9C699
򜚚
9C69A
򜚛
9C69B
򜚜
9C69C
򜚝
9C69D
򜚞
9C69E
򜚟
9C69F
90
A0
򜚠
9C6A0
򜚡
9C6A1
򜚢
9C6A2
򜚣
9C6A3
򜚤
9C6A4
򜚥
9C6A5
򜚦
9C6A6
򜚧
9C6A7
򜚨
9C6A8
򜚩
9C6A9
򜚪
9C6AA
򜚫
9C6AB
򜚬
9C6AC
򜚭
9C6AD
򜚮
9C6AE
򜚯
9C6AF
A0
B0
򜚰
9C6B0
򜚱
9C6B1
򜚲
9C6B2
򜚳
9C6B3
򜚴
9C6B4
򜚵
9C6B5
򜚶
9C6B6
򜚷
9C6B7
򜚸
9C6B8
򜚹
9C6B9
򜚺
9C6BA
򜚻
9C6BB
򜚼
9C6BC
򜚽
9C6BD
򜚾
9C6BE
򜚿
9C6BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]