International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A39A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򣚀
A3680
򣚁
A3681
򣚂
A3682
򣚃
A3683
򣚄
A3684
򣚅
A3685
򣚆
A3686
򣚇
A3687
򣚈
A3688
򣚉
A3689
򣚊
A368A
򣚋
A368B
򣚌
A368C
򣚍
A368D
򣚎
A368E
򣚏
A368F
80
90
򣚐
A3690
򣚑
A3691
򣚒
A3692
򣚓
A3693
򣚔
A3694
򣚕
A3695
򣚖
A3696
򣚗
A3697
򣚘
A3698
򣚙
A3699
򣚚
A369A
򣚛
A369B
򣚜
A369C
򣚝
A369D
򣚞
A369E
򣚟
A369F
90
A0
򣚠
A36A0
򣚡
A36A1
򣚢
A36A2
򣚣
A36A3
򣚤
A36A4
򣚥
A36A5
򣚦
A36A6
򣚧
A36A7
򣚨
A36A8
򣚩
A36A9
򣚪
A36AA
򣚫
A36AB
򣚬
A36AC
򣚭
A36AD
򣚮
A36AE
򣚯
A36AF
A0
B0
򣚰
A36B0
򣚱
A36B1
򣚲
A36B2
򣚳
A36B3
򣚴
A36B4
򣚵
A36B5
򣚶
A36B6
򣚷
A36B7
򣚸
A36B8
򣚹
A36B9
򣚺
A36BA
򣚻
A36BB
򣚼
A36BC
򣚽
A36BD
򣚾
A36BE
򣚿
A36BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]