International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F380A8

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󀨀
C0A00
󀨁
C0A01
󀨂
C0A02
󀨃
C0A03
󀨄
C0A04
󀨅
C0A05
󀨆
C0A06
󀨇
C0A07
󀨈
C0A08
󀨉
C0A09
󀨊
C0A0A
󀨋
C0A0B
󀨌
C0A0C
󀨍
C0A0D
󀨎
C0A0E
󀨏
C0A0F
80
90
󀨐
C0A10
󀨑
C0A11
󀨒
C0A12
󀨓
C0A13
󀨔
C0A14
󀨕
C0A15
󀨖
C0A16
󀨗
C0A17
󀨘
C0A18
󀨙
C0A19
󀨚
C0A1A
󀨛
C0A1B
󀨜
C0A1C
󀨝
C0A1D
󀨞
C0A1E
󀨟
C0A1F
90
A0
󀨠
C0A20
󀨡
C0A21
󀨢
C0A22
󀨣
C0A23
󀨤
C0A24
󀨥
C0A25
󀨦
C0A26
󀨧
C0A27
󀨨
C0A28
󀨩
C0A29
󀨪
C0A2A
󀨫
C0A2B
󀨬
C0A2C
󀨭
C0A2D
󀨮
C0A2E
󀨯
C0A2F
A0
B0
󀨰
C0A30
󀨱
C0A31
󀨲
C0A32
󀨳
C0A33
󀨴
C0A34
󀨵
C0A35
󀨶
C0A36
󀨷
C0A37
󀨸
C0A38
󀨹
C0A39
󀨺
C0A3A
󀨻
C0A3B
󀨼
C0A3C
󀨽
C0A3D
󀨾
C0A3E
󀨿
C0A3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]