International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B499

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𴙀
34640
𴙁
34641
𴙂
34642
𴙃
34643
𴙄
34644
𴙅
34645
𴙆
34646
𴙇
34647
𴙈
34648
𴙉
34649
𴙊
3464A
𴙋
3464B
𴙌
3464C
𴙍
3464D
𴙎
3464E
𴙏
3464F
80
90
𴙐
34650
𴙑
34651
𴙒
34652
𴙓
34653
𴙔
34654
𴙕
34655
𴙖
34656
𴙗
34657
𴙘
34658
𴙙
34659
𴙚
3465A
𴙛
3465B
𴙜
3465C
𴙝
3465D
𴙞
3465E
𴙟
3465F
90
A0
𴙠
34660
𴙡
34661
𴙢
34662
𴙣
34663
𴙤
34664
𴙥
34665
𴙦
34666
𴙧
34667
𴙨
34668
𴙩
34669
𴙪
3466A
𴙫
3466B
𴙬
3466C
𴙭
3466D
𴙮
3466E
𴙯
3466F
A0
B0
𴙰
34670
𴙱
34671
𴙲
34672
𴙳
34673
𴙴
34674
𴙵
34675
𴙶
34676
𴙷
34677
𴙸
34678
𴙹
34679
𴙺
3467A
𴙻
3467B
𴙼
3467C
𴙽
3467D
𴙾
3467E
𴙿
3467F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]