International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48C9A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􌚀
10C680
􌚁
10C681
􌚂
10C682
􌚃
10C683
􌚄
10C684
􌚅
10C685
􌚆
10C686
􌚇
10C687
􌚈
10C688
􌚉
10C689
􌚊
10C68A
􌚋
10C68B
􌚌
10C68C
􌚍
10C68D
􌚎
10C68E
􌚏
10C68F
80
90
􌚐
10C690
􌚑
10C691
􌚒
10C692
􌚓
10C693
􌚔
10C694
􌚕
10C695
􌚖
10C696
􌚗
10C697
􌚘
10C698
􌚙
10C699
􌚚
10C69A
􌚛
10C69B
􌚜
10C69C
􌚝
10C69D
􌚞
10C69E
􌚟
10C69F
90
A0
􌚠
10C6A0
􌚡
10C6A1
􌚢
10C6A2
􌚣
10C6A3
􌚤
10C6A4
􌚥
10C6A5
􌚦
10C6A6
􌚧
10C6A7
􌚨
10C6A8
􌚩
10C6A9
􌚪
10C6AA
􌚫
10C6AB
􌚬
10C6AC
􌚭
10C6AD
􌚮
10C6AE
􌚯
10C6AF
A0
B0
􌚰
10C6B0
􌚱
10C6B1
􌚲
10C6B2
􌚳
10C6B3
􌚴
10C6B4
􌚵
10C6B5
􌚶
10C6B6
􌚷
10C6B7
􌚸
10C6B8
􌚹
10C6B9
􌚺
10C6BA
􌚻
10C6BB
􌚼
10C6BC
􌚽
10C6BD
􌚾
10C6BE
􌚿
10C6BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]