International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F397A1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󗡀
D7840
󗡁
D7841
󗡂
D7842
󗡃
D7843
󗡄
D7844
󗡅
D7845
󗡆
D7846
󗡇
D7847
󗡈
D7848
󗡉
D7849
󗡊
D784A
󗡋
D784B
󗡌
D784C
󗡍
D784D
󗡎
D784E
󗡏
D784F
80
90
󗡐
D7850
󗡑
D7851
󗡒
D7852
󗡓
D7853
󗡔
D7854
󗡕
D7855
󗡖
D7856
󗡗
D7857
󗡘
D7858
󗡙
D7859
󗡚
D785A
󗡛
D785B
󗡜
D785C
󗡝
D785D
󗡞
D785E
󗡟
D785F
90
A0
󗡠
D7860
󗡡
D7861
󗡢
D7862
󗡣
D7863
󗡤
D7864
󗡥
D7865
󗡦
D7866
󗡧
D7867
󗡨
D7868
󗡩
D7869
󗡪
D786A
󗡫
D786B
󗡬
D786C
󗡭
D786D
󗡮
D786E
󗡯
D786F
A0
B0
󗡰
D7870
󗡱
D7871
󗡲
D7872
󗡳
D7873
󗡴
D7874
󗡵
D7875
󗡶
D7876
󗡷
D7877
󗡸
D7878
󗡹
D7879
󗡺
D787A
󗡻
D787B
󗡼
D787C
󗡽
D787D
󗡾
D787E
󗡿
D787F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]