International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F19CB0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񜰀
5CC00
񜰁
5CC01
񜰂
5CC02
񜰃
5CC03
񜰄
5CC04
񜰅
5CC05
񜰆
5CC06
񜰇
5CC07
񜰈
5CC08
񜰉
5CC09
񜰊
5CC0A
񜰋
5CC0B
񜰌
5CC0C
񜰍
5CC0D
񜰎
5CC0E
񜰏
5CC0F
80
90
񜰐
5CC10
񜰑
5CC11
񜰒
5CC12
񜰓
5CC13
񜰔
5CC14
񜰕
5CC15
񜰖
5CC16
񜰗
5CC17
񜰘
5CC18
񜰙
5CC19
񜰚
5CC1A
񜰛
5CC1B
񜰜
5CC1C
񜰝
5CC1D
񜰞
5CC1E
񜰟
5CC1F
90
A0
񜰠
5CC20
񜰡
5CC21
񜰢
5CC22
񜰣
5CC23
񜰤
5CC24
񜰥
5CC25
񜰦
5CC26
񜰧
5CC27
񜰨
5CC28
񜰩
5CC29
񜰪
5CC2A
񜰫
5CC2B
񜰬
5CC2C
񜰭
5CC2D
񜰮
5CC2E
񜰯
5CC2F
A0
B0
񜰰
5CC30
񜰱
5CC31
񜰲
5CC32
񜰳
5CC33
񜰴
5CC34
񜰵
5CC35
񜰶
5CC36
񜰷
5CC37
񜰸
5CC38
񜰹
5CC39
񜰺
5CC3A
񜰻
5CC3B
񜰼
5CC3C
񜰽
5CC3D
񜰾
5CC3E
񜰿
5CC3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]