International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F395A4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󕤀
D5900
󕤁
D5901
󕤂
D5902
󕤃
D5903
󕤄
D5904
󕤅
D5905
󕤆
D5906
󕤇
D5907
󕤈
D5908
󕤉
D5909
󕤊
D590A
󕤋
D590B
󕤌
D590C
󕤍
D590D
󕤎
D590E
󕤏
D590F
80
90
󕤐
D5910
󕤑
D5911
󕤒
D5912
󕤓
D5913
󕤔
D5914
󕤕
D5915
󕤖
D5916
󕤗
D5917
󕤘
D5918
󕤙
D5919
󕤚
D591A
󕤛
D591B
󕤜
D591C
󕤝
D591D
󕤞
D591E
󕤟
D591F
90
A0
󕤠
D5920
󕤡
D5921
󕤢
D5922
󕤣
D5923
󕤤
D5924
󕤥
D5925
󕤦
D5926
󕤧
D5927
󕤨
D5928
󕤩
D5929
󕤪
D592A
󕤫
D592B
󕤬
D592C
󕤭
D592D
󕤮
D592E
󕤯
D592F
A0
B0
󕤰
D5930
󕤱
D5931
󕤲
D5932
󕤳
D5933
󕤴
D5934
󕤵
D5935
󕤶
D5936
󕤷
D5937
󕤸
D5938
󕤹
D5939
󕤺
D593A
󕤻
D593B
󕤼
D593C
󕤽
D593D
󕤾
D593E
󕤿
D593F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]