International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39799

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󗙀
D7640
󗙁
D7641
󗙂
D7642
󗙃
D7643
󗙄
D7644
󗙅
D7645
󗙆
D7646
󗙇
D7647
󗙈
D7648
󗙉
D7649
󗙊
D764A
󗙋
D764B
󗙌
D764C
󗙍
D764D
󗙎
D764E
󗙏
D764F
80
90
󗙐
D7650
󗙑
D7651
󗙒
D7652
󗙓
D7653
󗙔
D7654
󗙕
D7655
󗙖
D7656
󗙗
D7657
󗙘
D7658
󗙙
D7659
󗙚
D765A
󗙛
D765B
󗙜
D765C
󗙝
D765D
󗙞
D765E
󗙟
D765F
90
A0
󗙠
D7660
󗙡
D7661
󗙢
D7662
󗙣
D7663
󗙤
D7664
󗙥
D7665
󗙦
D7666
󗙧
D7667
󗙨
D7668
󗙩
D7669
󗙪
D766A
󗙫
D766B
󗙬
D766C
󗙭
D766D
󗙮
D766E
󗙯
D766F
A0
B0
󗙰
D7670
󗙱
D7671
󗙲
D7672
󗙳
D7673
󗙴
D7674
󗙵
D7675
󗙶
D7676
󗙷
D7677
󗙸
D7678
󗙹
D7679
󗙺
D767A
󗙻
D767B
󗙼
D767C
󗙽
D767D
󗙾
D767E
󗙿
D767F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]