International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48AB8

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􊸀
10AE00
􊸁
10AE01
􊸂
10AE02
􊸃
10AE03
􊸄
10AE04
􊸅
10AE05
􊸆
10AE06
􊸇
10AE07
􊸈
10AE08
􊸉
10AE09
􊸊
10AE0A
􊸋
10AE0B
􊸌
10AE0C
􊸍
10AE0D
􊸎
10AE0E
􊸏
10AE0F
80
90
􊸐
10AE10
􊸑
10AE11
􊸒
10AE12
􊸓
10AE13
􊸔
10AE14
􊸕
10AE15
􊸖
10AE16
􊸗
10AE17
􊸘
10AE18
􊸙
10AE19
􊸚
10AE1A
􊸛
10AE1B
􊸜
10AE1C
􊸝
10AE1D
􊸞
10AE1E
􊸟
10AE1F
90
A0
􊸠
10AE20
􊸡
10AE21
􊸢
10AE22
􊸣
10AE23
􊸤
10AE24
􊸥
10AE25
􊸦
10AE26
􊸧
10AE27
􊸨
10AE28
􊸩
10AE29
􊸪
10AE2A
􊸫
10AE2B
􊸬
10AE2C
􊸭
10AE2D
􊸮
10AE2E
􊸯
10AE2F
A0
B0
􊸰
10AE30
􊸱
10AE31
􊸲
10AE32
􊸳
10AE33
􊸴
10AE34
􊸵
10AE35
􊸶
10AE36
􊸷
10AE37
􊸸
10AE38
􊸹
10AE39
􊸺
10AE3A
􊸻
10AE3B
􊸼
10AE3C
􊸽
10AE3D
􊸾
10AE3E
􊸿
10AE3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]