International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1A1A4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񡤀
61900
񡤁
61901
񡤂
61902
񡤃
61903
񡤄
61904
񡤅
61905
񡤆
61906
񡤇
61907
񡤈
61908
񡤉
61909
񡤊
6190A
񡤋
6190B
񡤌
6190C
񡤍
6190D
񡤎
6190E
񡤏
6190F
80
90
񡤐
61910
񡤑
61911
񡤒
61912
񡤓
61913
񡤔
61914
񡤕
61915
񡤖
61916
񡤗
61917
񡤘
61918
񡤙
61919
񡤚
6191A
񡤛
6191B
񡤜
6191C
񡤝
6191D
񡤞
6191E
񡤟
6191F
90
A0
񡤠
61920
񡤡
61921
񡤢
61922
񡤣
61923
񡤤
61924
񡤥
61925
񡤦
61926
񡤧
61927
񡤨
61928
񡤩
61929
񡤪
6192A
񡤫
6192B
񡤬
6192C
񡤭
6192D
񡤮
6192E
񡤯
6192F
A0
B0
񡤰
61930
񡤱
61931
񡤲
61932
񡤳
61933
񡤴
61934
񡤵
61935
񡤶
61936
񡤷
61937
񡤸
61938
񡤹
61939
񡤺
6193A
񡤻
6193B
񡤼
6193C
񡤽
6193D
񡤾
6193E
񡤿
6193F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]