International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2BCA6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򼦀
BC980
򼦁
BC981
򼦂
BC982
򼦃
BC983
򼦄
BC984
򼦅
BC985
򼦆
BC986
򼦇
BC987
򼦈
BC988
򼦉
BC989
򼦊
BC98A
򼦋
BC98B
򼦌
BC98C
򼦍
BC98D
򼦎
BC98E
򼦏
BC98F
80
90
򼦐
BC990
򼦑
BC991
򼦒
BC992
򼦓
BC993
򼦔
BC994
򼦕
BC995
򼦖
BC996
򼦗
BC997
򼦘
BC998
򼦙
BC999
򼦚
BC99A
򼦛
BC99B
򼦜
BC99C
򼦝
BC99D
򼦞
BC99E
򼦟
BC99F
90
A0
򼦠
BC9A0
򼦡
BC9A1
򼦢
BC9A2
򼦣
BC9A3
򼦤
BC9A4
򼦥
BC9A5
򼦦
BC9A6
򼦧
BC9A7
򼦨
BC9A8
򼦩
BC9A9
򼦪
BC9AA
򼦫
BC9AB
򼦬
BC9AC
򼦭
BC9AD
򼦮
BC9AE
򼦯
BC9AF
A0
B0
򼦰
BC9B0
򼦱
BC9B1
򼦲
BC9B2
򼦳
BC9B3
򼦴
BC9B4
򼦵
BC9B5
򼦶
BC9B6
򼦷
BC9B7
򼦸
BC9B8
򼦹
BC9B9
򼦺
BC9BA
򼦻
BC9BB
򼦼
BC9BC
򼦽
BC9BD
򼦾
BC9BE
򼦿
BC9BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]