International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2BEA6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򾦀
BE980
򾦁
BE981
򾦂
BE982
򾦃
BE983
򾦄
BE984
򾦅
BE985
򾦆
BE986
򾦇
BE987
򾦈
BE988
򾦉
BE989
򾦊
BE98A
򾦋
BE98B
򾦌
BE98C
򾦍
BE98D
򾦎
BE98E
򾦏
BE98F
80
90
򾦐
BE990
򾦑
BE991
򾦒
BE992
򾦓
BE993
򾦔
BE994
򾦕
BE995
򾦖
BE996
򾦗
BE997
򾦘
BE998
򾦙
BE999
򾦚
BE99A
򾦛
BE99B
򾦜
BE99C
򾦝
BE99D
򾦞
BE99E
򾦟
BE99F
90
A0
򾦠
BE9A0
򾦡
BE9A1
򾦢
BE9A2
򾦣
BE9A3
򾦤
BE9A4
򾦥
BE9A5
򾦦
BE9A6
򾦧
BE9A7
򾦨
BE9A8
򾦩
BE9A9
򾦪
BE9AA
򾦫
BE9AB
򾦬
BE9AC
򾦭
BE9AD
򾦮
BE9AE
򾦯
BE9AF
A0
B0
򾦰
BE9B0
򾦱
BE9B1
򾦲
BE9B2
򾦳
BE9B3
򾦴
BE9B4
򾦵
BE9B5
򾦶
BE9B6
򾦷
BE9B7
򾦸
BE9B8
򾦹
BE9B9
򾦺
BE9BA
򾦻
BE9BB
򾦼
BE9BC
򾦽
BE9BD
򾦾
BE9BE
򾦿
BE9BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]