International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1B2B0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񲰀
72C00
񲰁
72C01
񲰂
72C02
񲰃
72C03
񲰄
72C04
񲰅
72C05
񲰆
72C06
񲰇
72C07
񲰈
72C08
񲰉
72C09
񲰊
72C0A
񲰋
72C0B
񲰌
72C0C
񲰍
72C0D
񲰎
72C0E
񲰏
72C0F
80
90
񲰐
72C10
񲰑
72C11
񲰒
72C12
񲰓
72C13
񲰔
72C14
񲰕
72C15
񲰖
72C16
񲰗
72C17
񲰘
72C18
񲰙
72C19
񲰚
72C1A
񲰛
72C1B
񲰜
72C1C
񲰝
72C1D
񲰞
72C1E
񲰟
72C1F
90
A0
񲰠
72C20
񲰡
72C21
񲰢
72C22
񲰣
72C23
񲰤
72C24
񲰥
72C25
񲰦
72C26
񲰧
72C27
񲰨
72C28
񲰩
72C29
񲰪
72C2A
񲰫
72C2B
񲰬
72C2C
񲰭
72C2D
񲰮
72C2E
񲰯
72C2F
A0
B0
񲰰
72C30
񲰱
72C31
񲰲
72C32
񲰳
72C33
񲰴
72C34
񲰵
72C35
񲰶
72C36
񲰷
72C37
񲰸
72C38
񲰹
72C39
񲰺
72C3A
񲰻
72C3B
񲰼
72C3C
񲰽
72C3D
񲰾
72C3E
񲰿
72C3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]