International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F286A6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򆦀
86980
򆦁
86981
򆦂
86982
򆦃
86983
򆦄
86984
򆦅
86985
򆦆
86986
򆦇
86987
򆦈
86988
򆦉
86989
򆦊
8698A
򆦋
8698B
򆦌
8698C
򆦍
8698D
򆦎
8698E
򆦏
8698F
80
90
򆦐
86990
򆦑
86991
򆦒
86992
򆦓
86993
򆦔
86994
򆦕
86995
򆦖
86996
򆦗
86997
򆦘
86998
򆦙
86999
򆦚
8699A
򆦛
8699B
򆦜
8699C
򆦝
8699D
򆦞
8699E
򆦟
8699F
90
A0
򆦠
869A0
򆦡
869A1
򆦢
869A2
򆦣
869A3
򆦤
869A4
򆦥
869A5
򆦦
869A6
򆦧
869A7
򆦨
869A8
򆦩
869A9
򆦪
869AA
򆦫
869AB
򆦬
869AC
򆦭
869AD
򆦮
869AE
򆦯
869AF
A0
B0
򆦰
869B0
򆦱
869B1
򆦲
869B2
򆦳
869B3
򆦴
869B4
򆦵
869B5
򆦶
869B6
򆦷
869B7
򆦸
869B8
򆦹
869B9
򆦺
869BA
򆦻
869BB
򆦼
869BC
򆦽
869BD
򆦾
869BE
򆦿
869BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]