International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38083

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󀃀
C00C0
󀃁
C00C1
󀃂
C00C2
󀃃
C00C3
󀃄
C00C4
󀃅
C00C5
󀃆
C00C6
󀃇
C00C7
󀃈
C00C8
󀃉
C00C9
󀃊
C00CA
󀃋
C00CB
󀃌
C00CC
󀃍
C00CD
󀃎
C00CE
󀃏
C00CF
80
90
󀃐
C00D0
󀃑
C00D1
󀃒
C00D2
󀃓
C00D3
󀃔
C00D4
󀃕
C00D5
󀃖
C00D6
󀃗
C00D7
󀃘
C00D8
󀃙
C00D9
󀃚
C00DA
󀃛
C00DB
󀃜
C00DC
󀃝
C00DD
󀃞
C00DE
󀃟
C00DF
90
A0
󀃠
C00E0
󀃡
C00E1
󀃢
C00E2
󀃣
C00E3
󀃤
C00E4
󀃥
C00E5
󀃦
C00E6
󀃧
C00E7
󀃨
C00E8
󀃩
C00E9
󀃪
C00EA
󀃫
C00EB
󀃬
C00EC
󀃭
C00ED
󀃮
C00EE
󀃯
C00EF
A0
B0
󀃰
C00F0
󀃱
C00F1
󀃲
C00F2
󀃳
C00F3
󀃴
C00F4
󀃵
C00F5
󀃶
C00F6
󀃷
C00F7
󀃸
C00F8
󀃹
C00F9
󀃺
C00FA
󀃻
C00FB
󀃼
C00FC
󀃽
C00FD
󀃾
C00FE
󀃿
C00FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]