International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F382B0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󂰀
C2C00
󂰁
C2C01
󂰂
C2C02
󂰃
C2C03
󂰄
C2C04
󂰅
C2C05
󂰆
C2C06
󂰇
C2C07
󂰈
C2C08
󂰉
C2C09
󂰊
C2C0A
󂰋
C2C0B
󂰌
C2C0C
󂰍
C2C0D
󂰎
C2C0E
󂰏
C2C0F
80
90
󂰐
C2C10
󂰑
C2C11
󂰒
C2C12
󂰓
C2C13
󂰔
C2C14
󂰕
C2C15
󂰖
C2C16
󂰗
C2C17
󂰘
C2C18
󂰙
C2C19
󂰚
C2C1A
󂰛
C2C1B
󂰜
C2C1C
󂰝
C2C1D
󂰞
C2C1E
󂰟
C2C1F
90
A0
󂰠
C2C20
󂰡
C2C21
󂰢
C2C22
󂰣
C2C23
󂰤
C2C24
󂰥
C2C25
󂰦
C2C26
󂰧
C2C27
󂰨
C2C28
󂰩
C2C29
󂰪
C2C2A
󂰫
C2C2B
󂰬
C2C2C
󂰭
C2C2D
󂰮
C2C2E
󂰯
C2C2F
A0
B0
󂰰
C2C30
󂰱
C2C31
󂰲
C2C32
󂰳
C2C33
󂰴
C2C34
󂰵
C2C35
󂰶
C2C36
󂰷
C2C37
󂰸
C2C38
󂰹
C2C39
󂰺
C2C3A
󂰻
C2C3B
󂰼
C2C3C
󂰽
C2C3D
󂰾
C2C3E
󂰿
C2C3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]