International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F480B0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􀰀
100C00
􀰁
100C01
􀰂
100C02
􀰃
100C03
􀰄
100C04
􀰅
100C05
􀰆
100C06
􀰇
100C07
􀰈
100C08
􀰉
100C09
􀰊
100C0A
􀰋
100C0B
􀰌
100C0C
􀰍
100C0D
􀰎
100C0E
􀰏
100C0F
80
90
􀰐
100C10
􀰑
100C11
􀰒
100C12
􀰓
100C13
􀰔
100C14
􀰕
100C15
􀰖
100C16
􀰗
100C17
􀰘
100C18
􀰙
100C19
􀰚
100C1A
􀰛
100C1B
􀰜
100C1C
􀰝
100C1D
􀰞
100C1E
􀰟
100C1F
90
A0
􀰠
100C20
􀰡
100C21
􀰢
100C22
􀰣
100C23
􀰤
100C24
􀰥
100C25
􀰦
100C26
􀰧
100C27
􀰨
100C28
􀰩
100C29
􀰪
100C2A
􀰫
100C2B
􀰬
100C2C
􀰭
100C2D
􀰮
100C2E
􀰯
100C2F
A0
B0
􀰰
100C30
􀰱
100C31
􀰲
100C32
􀰳
100C33
􀰴
100C34
􀰵
100C35
􀰶
100C36
􀰷
100C37
􀰸
100C38
􀰹
100C39
􀰺
100C3A
􀰻
100C3B
􀰼
100C3C
􀰽
100C3D
􀰾
100C3E
􀰿
100C3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]