International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F398A4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󘤀
D8900
󘤁
D8901
󘤂
D8902
󘤃
D8903
󘤄
D8904
󘤅
D8905
󘤆
D8906
󘤇
D8907
󘤈
D8908
󘤉
D8909
󘤊
D890A
󘤋
D890B
󘤌
D890C
󘤍
D890D
󘤎
D890E
󘤏
D890F
80
90
󘤐
D8910
󘤑
D8911
󘤒
D8912
󘤓
D8913
󘤔
D8914
󘤕
D8915
󘤖
D8916
󘤗
D8917
󘤘
D8918
󘤙
D8919
󘤚
D891A
󘤛
D891B
󘤜
D891C
󘤝
D891D
󘤞
D891E
󘤟
D891F
90
A0
󘤠
D8920
󘤡
D8921
󘤢
D8922
󘤣
D8923
󘤤
D8924
󘤥
D8925
󘤦
D8926
󘤧
D8927
󘤨
D8928
󘤩
D8929
󘤪
D892A
󘤫
D892B
󘤬
D892C
󘤭
D892D
󘤮
D892E
󘤯
D892F
A0
B0
󘤰
D8930
󘤱
D8931
󘤲
D8932
󘤳
D8933
󘤴
D8934
󘤵
D8935
󘤶
D8936
󘤷
D8937
󘤸
D8938
󘤹
D8939
󘤺
D893A
󘤻
D893B
󘤼
D893C
󘤽
D893D
󘤾
D893E
󘤿
D893F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]