International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1B1B0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񱰀
71C00
񱰁
71C01
񱰂
71C02
񱰃
71C03
񱰄
71C04
񱰅
71C05
񱰆
71C06
񱰇
71C07
񱰈
71C08
񱰉
71C09
񱰊
71C0A
񱰋
71C0B
񱰌
71C0C
񱰍
71C0D
񱰎
71C0E
񱰏
71C0F
80
90
񱰐
71C10
񱰑
71C11
񱰒
71C12
񱰓
71C13
񱰔
71C14
񱰕
71C15
񱰖
71C16
񱰗
71C17
񱰘
71C18
񱰙
71C19
񱰚
71C1A
񱰛
71C1B
񱰜
71C1C
񱰝
71C1D
񱰞
71C1E
񱰟
71C1F
90
A0
񱰠
71C20
񱰡
71C21
񱰢
71C22
񱰣
71C23
񱰤
71C24
񱰥
71C25
񱰦
71C26
񱰧
71C27
񱰨
71C28
񱰩
71C29
񱰪
71C2A
񱰫
71C2B
񱰬
71C2C
񱰭
71C2D
񱰮
71C2E
񱰯
71C2F
A0
B0
񱰰
71C30
񱰱
71C31
񱰲
71C32
񱰳
71C33
񱰴
71C34
񱰵
71C35
񱰶
71C36
񱰷
71C37
񱰸
71C38
񱰹
71C39
񱰺
71C3A
񱰻
71C3B
񱰼
71C3C
񱰽
71C3D
񱰾
71C3E
񱰿
71C3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]