International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A2A4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򢤀
A2900
򢤁
A2901
򢤂
A2902
򢤃
A2903
򢤄
A2904
򢤅
A2905
򢤆
A2906
򢤇
A2907
򢤈
A2908
򢤉
A2909
򢤊
A290A
򢤋
A290B
򢤌
A290C
򢤍
A290D
򢤎
A290E
򢤏
A290F
80
90
򢤐
A2910
򢤑
A2911
򢤒
A2912
򢤓
A2913
򢤔
A2914
򢤕
A2915
򢤖
A2916
򢤗
A2917
򢤘
A2918
򢤙
A2919
򢤚
A291A
򢤛
A291B
򢤜
A291C
򢤝
A291D
򢤞
A291E
򢤟
A291F
90
A0
򢤠
A2920
򢤡
A2921
򢤢
A2922
򢤣
A2923
򢤤
A2924
򢤥
A2925
򢤦
A2926
򢤧
A2927
򢤨
A2928
򢤩
A2929
򢤪
A292A
򢤫
A292B
򢤬
A292C
򢤭
A292D
򢤮
A292E
򢤯
A292F
A0
B0
򢤰
A2930
򢤱
A2931
򢤲
A2932
򢤳
A2933
򢤴
A2934
򢤵
A2935
򢤶
A2936
򢤷
A2937
򢤸
A2938
򢤹
A2939
򢤺
A293A
򢤻
A293B
򢤼
A293C
򢤽
A293D
򢤾
A293E
򢤿
A293F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]