International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F384A8

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󄨀
C4A00
󄨁
C4A01
󄨂
C4A02
󄨃
C4A03
󄨄
C4A04
󄨅
C4A05
󄨆
C4A06
󄨇
C4A07
󄨈
C4A08
󄨉
C4A09
󄨊
C4A0A
󄨋
C4A0B
󄨌
C4A0C
󄨍
C4A0D
󄨎
C4A0E
󄨏
C4A0F
80
90
󄨐
C4A10
󄨑
C4A11
󄨒
C4A12
󄨓
C4A13
󄨔
C4A14
󄨕
C4A15
󄨖
C4A16
󄨗
C4A17
󄨘
C4A18
󄨙
C4A19
󄨚
C4A1A
󄨛
C4A1B
󄨜
C4A1C
󄨝
C4A1D
󄨞
C4A1E
󄨟
C4A1F
90
A0
󄨠
C4A20
󄨡
C4A21
󄨢
C4A22
󄨣
C4A23
󄨤
C4A24
󄨥
C4A25
󄨦
C4A26
󄨧
C4A27
󄨨
C4A28
󄨩
C4A29
󄨪
C4A2A
󄨫
C4A2B
󄨬
C4A2C
󄨭
C4A2D
󄨮
C4A2E
󄨯
C4A2F
A0
B0
󄨰
C4A30
󄨱
C4A31
󄨲
C4A32
󄨳
C4A33
󄨴
C4A34
󄨵
C4A35
󄨶
C4A36
󄨷
C4A37
󄨸
C4A38
󄨹
C4A39
󄨺
C4A3A
󄨻
C4A3B
󄨼
C4A3C
󄨽
C4A3D
󄨾
C4A3E
󄨿
C4A3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]