International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2B5A6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򵦀
B5980
򵦁
B5981
򵦂
B5982
򵦃
B5983
򵦄
B5984
򵦅
B5985
򵦆
B5986
򵦇
B5987
򵦈
B5988
򵦉
B5989
򵦊
B598A
򵦋
B598B
򵦌
B598C
򵦍
B598D
򵦎
B598E
򵦏
B598F
80
90
򵦐
B5990
򵦑
B5991
򵦒
B5992
򵦓
B5993
򵦔
B5994
򵦕
B5995
򵦖
B5996
򵦗
B5997
򵦘
B5998
򵦙
B5999
򵦚
B599A
򵦛
B599B
򵦜
B599C
򵦝
B599D
򵦞
B599E
򵦟
B599F
90
A0
򵦠
B59A0
򵦡
B59A1
򵦢
B59A2
򵦣
B59A3
򵦤
B59A4
򵦥
B59A5
򵦦
B59A6
򵦧
B59A7
򵦨
B59A8
򵦩
B59A9
򵦪
B59AA
򵦫
B59AB
򵦬
B59AC
򵦭
B59AD
򵦮
B59AE
򵦯
B59AF
A0
B0
򵦰
B59B0
򵦱
B59B1
򵦲
B59B2
򵦳
B59B3
򵦴
B59B4
򵦵
B59B5
򵦶
B59B6
򵦷
B59B7
򵦸
B59B8
򵦹
B59B9
򵦺
B59BA
򵦻
B59BB
򵦼
B59BC
򵦽
B59BD
򵦾
B59BE
򵦿
B59BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]