International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A8A4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򨤀
A8900
򨤁
A8901
򨤂
A8902
򨤃
A8903
򨤄
A8904
򨤅
A8905
򨤆
A8906
򨤇
A8907
򨤈
A8908
򨤉
A8909
򨤊
A890A
򨤋
A890B
򨤌
A890C
򨤍
A890D
򨤎
A890E
򨤏
A890F
80
90
򨤐
A8910
򨤑
A8911
򨤒
A8912
򨤓
A8913
򨤔
A8914
򨤕
A8915
򨤖
A8916
򨤗
A8917
򨤘
A8918
򨤙
A8919
򨤚
A891A
򨤛
A891B
򨤜
A891C
򨤝
A891D
򨤞
A891E
򨤟
A891F
90
A0
򨤠
A8920
򨤡
A8921
򨤢
A8922
򨤣
A8923
򨤤
A8924
򨤥
A8925
򨤦
A8926
򨤧
A8927
򨤨
A8928
򨤩
A8929
򨤪
A892A
򨤫
A892B
򨤬
A892C
򨤭
A892D
򨤮
A892E
򨤯
A892F
A0
B0
򨤰
A8930
򨤱
A8931
򨤲
A8932
򨤳
A8933
򨤴
A8934
򨤵
A8935
򨤶
A8936
򨤷
A8937
򨤸
A8938
򨤹
A8939
򨤺
A893A
򨤻
A893B
򨤼
A893C
򨤽
A893D
򨤾
A893E
򨤿
A893F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]