International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F190B0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񐰀
50C00
񐰁
50C01
񐰂
50C02
񐰃
50C03
񐰄
50C04
񐰅
50C05
񐰆
50C06
񐰇
50C07
񐰈
50C08
񐰉
50C09
񐰊
50C0A
񐰋
50C0B
񐰌
50C0C
񐰍
50C0D
񐰎
50C0E
񐰏
50C0F
80
90
񐰐
50C10
񐰑
50C11
񐰒
50C12
񐰓
50C13
񐰔
50C14
񐰕
50C15
񐰖
50C16
񐰗
50C17
񐰘
50C18
񐰙
50C19
񐰚
50C1A
񐰛
50C1B
񐰜
50C1C
񐰝
50C1D
񐰞
50C1E
񐰟
50C1F
90
A0
񐰠
50C20
񐰡
50C21
񐰢
50C22
񐰣
50C23
񐰤
50C24
񐰥
50C25
񐰦
50C26
񐰧
50C27
񐰨
50C28
񐰩
50C29
񐰪
50C2A
񐰫
50C2B
񐰬
50C2C
񐰭
50C2D
񐰮
50C2E
񐰯
50C2F
A0
B0
񐰰
50C30
񐰱
50C31
񐰲
50C32
񐰳
50C33
񐰴
50C34
񐰵
50C35
񐰶
50C36
񐰷
50C37
񐰸
50C38
񐰹
50C39
񐰺
50C3A
񐰻
50C3B
񐰼
50C3C
񐰽
50C3D
񐰾
50C3E
񐰿
50C3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]