International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F19CA4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񜤀
5C900
񜤁
5C901
񜤂
5C902
񜤃
5C903
񜤄
5C904
񜤅
5C905
񜤆
5C906
񜤇
5C907
񜤈
5C908
񜤉
5C909
񜤊
5C90A
񜤋
5C90B
񜤌
5C90C
񜤍
5C90D
񜤎
5C90E
񜤏
5C90F
80
90
񜤐
5C910
񜤑
5C911
񜤒
5C912
񜤓
5C913
񜤔
5C914
񜤕
5C915
񜤖
5C916
񜤗
5C917
񜤘
5C918
񜤙
5C919
񜤚
5C91A
񜤛
5C91B
񜤜
5C91C
񜤝
5C91D
񜤞
5C91E
񜤟
5C91F
90
A0
񜤠
5C920
񜤡
5C921
񜤢
5C922
񜤣
5C923
񜤤
5C924
񜤥
5C925
񜤦
5C926
񜤧
5C927
񜤨
5C928
񜤩
5C929
񜤪
5C92A
񜤫
5C92B
񜤬
5C92C
񜤭
5C92D
񜤮
5C92E
񜤯
5C92F
A0
B0
񜤰
5C930
񜤱
5C931
񜤲
5C932
񜤳
5C933
񜤴
5C934
񜤵
5C935
񜤶
5C936
񜤷
5C937
񜤸
5C938
񜤹
5C939
񜤺
5C93A
񜤻
5C93B
񜤼
5C93C
񜤽
5C93D
񜤾
5C93E
񜤿
5C93F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]