International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F48A99

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􊙀
10A640
􊙁
10A641
􊙂
10A642
􊙃
10A643
􊙄
10A644
􊙅
10A645
􊙆
10A646
􊙇
10A647
􊙈
10A648
􊙉
10A649
􊙊
10A64A
􊙋
10A64B
􊙌
10A64C
􊙍
10A64D
􊙎
10A64E
􊙏
10A64F
80
90
􊙐
10A650
􊙑
10A651
􊙒
10A652
􊙓
10A653
􊙔
10A654
􊙕
10A655
􊙖
10A656
􊙗
10A657
􊙘
10A658
􊙙
10A659
􊙚
10A65A
􊙛
10A65B
􊙜
10A65C
􊙝
10A65D
􊙞
10A65E
􊙟
10A65F
90
A0
􊙠
10A660
􊙡
10A661
􊙢
10A662
􊙣
10A663
􊙤
10A664
􊙥
10A665
􊙦
10A666
􊙧
10A667
􊙨
10A668
􊙩
10A669
􊙪
10A66A
􊙫
10A66B
􊙬
10A66C
􊙭
10A66D
􊙮
10A66E
􊙯
10A66F
A0
B0
􊙰
10A670
􊙱
10A671
􊙲
10A672
􊙳
10A673
􊙴
10A674
􊙵
10A675
􊙶
10A676
􊙷
10A677
􊙸
10A678
􊙹
10A679
􊙺
10A67A
􊙻
10A67B
􊙼
10A67C
􊙽
10A67D
􊙾
10A67E
􊙿
10A67F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]