International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F283A6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򃦀
83980
򃦁
83981
򃦂
83982
򃦃
83983
򃦄
83984
򃦅
83985
򃦆
83986
򃦇
83987
򃦈
83988
򃦉
83989
򃦊
8398A
򃦋
8398B
򃦌
8398C
򃦍
8398D
򃦎
8398E
򃦏
8398F
80
90
򃦐
83990
򃦑
83991
򃦒
83992
򃦓
83993
򃦔
83994
򃦕
83995
򃦖
83996
򃦗
83997
򃦘
83998
򃦙
83999
򃦚
8399A
򃦛
8399B
򃦜
8399C
򃦝
8399D
򃦞
8399E
򃦟
8399F
90
A0
򃦠
839A0
򃦡
839A1
򃦢
839A2
򃦣
839A3
򃦤
839A4
򃦥
839A5
򃦦
839A6
򃦧
839A7
򃦨
839A8
򃦩
839A9
򃦪
839AA
򃦫
839AB
򃦬
839AC
򃦭
839AD
򃦮
839AE
򃦯
839AF
A0
B0
򃦰
839B0
򃦱
839B1
򃦲
839B2
򃦳
839B3
򃦴
839B4
򃦵
839B5
򃦶
839B6
򃦷
839B7
򃦸
839B8
򃦹
839B9
򃦺
839BA
򃦻
839BB
򃦼
839BC
򃦽
839BD
򃦾
839BE
򃦿
839BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]