International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2BDA4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򽤀
BD900
򽤁
BD901
򽤂
BD902
򽤃
BD903
򽤄
BD904
򽤅
BD905
򽤆
BD906
򽤇
BD907
򽤈
BD908
򽤉
BD909
򽤊
BD90A
򽤋
BD90B
򽤌
BD90C
򽤍
BD90D
򽤎
BD90E
򽤏
BD90F
80
90
򽤐
BD910
򽤑
BD911
򽤒
BD912
򽤓
BD913
򽤔
BD914
򽤕
BD915
򽤖
BD916
򽤗
BD917
򽤘
BD918
򽤙
BD919
򽤚
BD91A
򽤛
BD91B
򽤜
BD91C
򽤝
BD91D
򽤞
BD91E
򽤟
BD91F
90
A0
򽤠
BD920
򽤡
BD921
򽤢
BD922
򽤣
BD923
򽤤
BD924
򽤥
BD925
򽤦
BD926
򽤧
BD927
򽤨
BD928
򽤩
BD929
򽤪
BD92A
򽤫
BD92B
򽤬
BD92C
򽤭
BD92D
򽤮
BD92E
򽤯
BD92F
A0
B0
򽤰
BD930
򽤱
BD931
򽤲
BD932
򽤳
BD933
򽤴
BD934
򽤵
BD935
򽤶
BD936
򽤷
BD937
򽤸
BD938
򽤹
BD939
򽤺
BD93A
򽤻
BD93B
򽤼
BD93C
򽤽
BD93D
򽤾
BD93E
򽤿
BD93F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]