International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F290A4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򐤀
90900
򐤁
90901
򐤂
90902
򐤃
90903
򐤄
90904
򐤅
90905
򐤆
90906
򐤇
90907
򐤈
90908
򐤉
90909
򐤊
9090A
򐤋
9090B
򐤌
9090C
򐤍
9090D
򐤎
9090E
򐤏
9090F
80
90
򐤐
90910
򐤑
90911
򐤒
90912
򐤓
90913
򐤔
90914
򐤕
90915
򐤖
90916
򐤗
90917
򐤘
90918
򐤙
90919
򐤚
9091A
򐤛
9091B
򐤜
9091C
򐤝
9091D
򐤞
9091E
򐤟
9091F
90
A0
򐤠
90920
򐤡
90921
򐤢
90922
򐤣
90923
򐤤
90924
򐤥
90925
򐤦
90926
򐤧
90927
򐤨
90928
򐤩
90929
򐤪
9092A
򐤫
9092B
򐤬
9092C
򐤭
9092D
򐤮
9092E
򐤯
9092F
A0
B0
򐤰
90930
򐤱
90931
򐤲
90932
򐤳
90933
򐤴
90934
򐤵
90935
򐤶
90936
򐤷
90937
򐤸
90938
򐤹
90939
򐤺
9093A
򐤻
9093B
򐤼
9093C
򐤽
9093D
򐤾
9093E
򐤿
9093F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]