International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39883

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󘃀
D80C0
󘃁
D80C1
󘃂
D80C2
󘃃
D80C3
󘃄
D80C4
󘃅
D80C5
󘃆
D80C6
󘃇
D80C7
󘃈
D80C8
󘃉
D80C9
󘃊
D80CA
󘃋
D80CB
󘃌
D80CC
󘃍
D80CD
󘃎
D80CE
󘃏
D80CF
80
90
󘃐
D80D0
󘃑
D80D1
󘃒
D80D2
󘃓
D80D3
󘃔
D80D4
󘃕
D80D5
󘃖
D80D6
󘃗
D80D7
󘃘
D80D8
󘃙
D80D9
󘃚
D80DA
󘃛
D80DB
󘃜
D80DC
󘃝
D80DD
󘃞
D80DE
󘃟
D80DF
90
A0
󘃠
D80E0
󘃡
D80E1
󘃢
D80E2
󘃣
D80E3
󘃤
D80E4
󘃥
D80E5
󘃦
D80E6
󘃧
D80E7
󘃨
D80E8
󘃩
D80E9
󘃪
D80EA
󘃫
D80EB
󘃬
D80EC
󘃭
D80ED
󘃮
D80EE
󘃯
D80EF
A0
B0
󘃰
D80F0
󘃱
D80F1
󘃲
D80F2
󘃳
D80F3
󘃴
D80F4
󘃵
D80F5
󘃶
D80F6
󘃷
D80F7
󘃸
D80F8
󘃹
D80F9
󘃺
D80FA
󘃻
D80FB
󘃼
D80FC
󘃽
D80FD
󘃾
D80FE
󘃿
D80FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]