International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F092B0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𒰀
12C00
𒰁
12C01
𒰂
12C02
𒰃
12C03
𒰄
12C04
𒰅
12C05
𒰆
12C06
𒰇
12C07
𒰈
12C08
𒰉
12C09
𒰊
12C0A
𒰋
12C0B
𒰌
12C0C
𒰍
12C0D
𒰎
12C0E
𒰏
12C0F
80
90
𒰐
12C10
𒰑
12C11
𒰒
12C12
𒰓
12C13
𒰔
12C14
𒰕
12C15
𒰖
12C16
𒰗
12C17
𒰘
12C18
𒰙
12C19
𒰚
12C1A
𒰛
12C1B
𒰜
12C1C
𒰝
12C1D
𒰞
12C1E
𒰟
12C1F
90
A0
𒰠
12C20
𒰡
12C21
𒰢
12C22
𒰣
12C23
𒰤
12C24
𒰥
12C25
𒰦
12C26
𒰧
12C27
𒰨
12C28
𒰩
12C29
𒰪
12C2A
𒰫
12C2B
𒰬
12C2C
𒰭
12C2D
𒰮
12C2E
𒰯
12C2F
A0
B0
𒰰
12C30
𒰱
12C31
𒰲
12C32
𒰳
12C33
𒰴
12C34
𒰵
12C35
𒰶
12C36
𒰷
12C37
𒰸
12C38
𒰹
12C39
𒰺
12C3A
𒰻
12C3B
𒰼
12C3C
𒰽
12C3D
𒰾
12C3E
𒰿
12C3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]