International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F295B0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򕰀
95C00
򕰁
95C01
򕰂
95C02
򕰃
95C03
򕰄
95C04
򕰅
95C05
򕰆
95C06
򕰇
95C07
򕰈
95C08
򕰉
95C09
򕰊
95C0A
򕰋
95C0B
򕰌
95C0C
򕰍
95C0D
򕰎
95C0E
򕰏
95C0F
80
90
򕰐
95C10
򕰑
95C11
򕰒
95C12
򕰓
95C13
򕰔
95C14
򕰕
95C15
򕰖
95C16
򕰗
95C17
򕰘
95C18
򕰙
95C19
򕰚
95C1A
򕰛
95C1B
򕰜
95C1C
򕰝
95C1D
򕰞
95C1E
򕰟
95C1F
90
A0
򕰠
95C20
򕰡
95C21
򕰢
95C22
򕰣
95C23
򕰤
95C24
򕰥
95C25
򕰦
95C26
򕰧
95C27
򕰨
95C28
򕰩
95C29
򕰪
95C2A
򕰫
95C2B
򕰬
95C2C
򕰭
95C2D
򕰮
95C2E
򕰯
95C2F
A0
B0
򕰰
95C30
򕰱
95C31
򕰲
95C32
򕰳
95C33
򕰴
95C34
򕰵
95C35
򕰶
95C36
򕰷
95C37
򕰸
95C38
򕰹
95C39
򕰺
95C3A
򕰻
95C3B
򕰼
95C3C
򕰽
95C3D
򕰾
95C3E
򕰿
95C3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]