International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A4A4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󤤀
E4900
󤤁
E4901
󤤂
E4902
󤤃
E4903
󤤄
E4904
󤤅
E4905
󤤆
E4906
󤤇
E4907
󤤈
E4908
󤤉
E4909
󤤊
E490A
󤤋
E490B
󤤌
E490C
󤤍
E490D
󤤎
E490E
󤤏
E490F
80
90
󤤐
E4910
󤤑
E4911
󤤒
E4912
󤤓
E4913
󤤔
E4914
󤤕
E4915
󤤖
E4916
󤤗
E4917
󤤘
E4918
󤤙
E4919
󤤚
E491A
󤤛
E491B
󤤜
E491C
󤤝
E491D
󤤞
E491E
󤤟
E491F
90
A0
󤤠
E4920
󤤡
E4921
󤤢
E4922
󤤣
E4923
󤤤
E4924
󤤥
E4925
󤤦
E4926
󤤧
E4927
󤤨
E4928
󤤩
E4929
󤤪
E492A
󤤫
E492B
󤤬
E492C
󤤭
E492D
󤤮
E492E
󤤯
E492F
A0
B0
󤤰
E4930
󤤱
E4931
󤤲
E4932
󤤳
E4933
󤤴
E4934
󤤵
E4935
󤤶
E4936
󤤷
E4937
󤤸
E4938
󤤹
E4939
󤤺
E493A
󤤻
E493B
󤤼
E493C
󤤽
E493D
󤤾
E493E
󤤿
E493F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]