International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IBM IANA
UTF-8 ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F19BB0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񛰀
5BC00
񛰁
5BC01
񛰂
5BC02
񛰃
5BC03
񛰄
5BC04
񛰅
5BC05
񛰆
5BC06
񛰇
5BC07
񛰈
5BC08
񛰉
5BC09
񛰊
5BC0A
񛰋
5BC0B
񛰌
5BC0C
񛰍
5BC0D
񛰎
5BC0E
񛰏
5BC0F
80
90
񛰐
5BC10
񛰑
5BC11
񛰒
5BC12
񛰓
5BC13
񛰔
5BC14
񛰕
5BC15
񛰖
5BC16
񛰗
5BC17
񛰘
5BC18
񛰙
5BC19
񛰚
5BC1A
񛰛
5BC1B
񛰜
5BC1C
񛰝
5BC1D
񛰞
5BC1E
񛰟
5BC1F
90
A0
񛰠
5BC20
񛰡
5BC21
񛰢
5BC22
񛰣
5BC23
񛰤
5BC24
񛰥
5BC25
񛰦
5BC26
񛰧
5BC27
񛰨
5BC28
񛰩
5BC29
񛰪
5BC2A
񛰫
5BC2B
񛰬
5BC2C
񛰭
5BC2D
񛰮
5BC2E
񛰯
5BC2F
A0
B0
񛰰
5BC30
񛰱
5BC31
񛰲
5BC32
񛰳
5BC33
񛰴
5BC34
񛰵
5BC35
񛰶
5BC36
񛰷
5BC37
񛰸
5BC38
񛰹
5BC39
񛰺
5BC3A
񛰻
5BC3B
񛰼
5BC3C
񛰽
5BC3D
񛰾
5BC3E
񛰿
5BC3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]