International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09FBC

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
🼀
1FF00
🼁
1FF01
🼂
1FF02
🼃
1FF03
🼄
1FF04
🼅
1FF05
🼆
1FF06
🼇
1FF07
🼈
1FF08
🼉
1FF09
🼊
1FF0A
🼋
1FF0B
🼌
1FF0C
🼍
1FF0D
🼎
1FF0E
🼏
1FF0F
80
90
🼐
1FF10
🼑
1FF11
🼒
1FF12
🼓
1FF13
🼔
1FF14
🼕
1FF15
🼖
1FF16
🼗
1FF17
🼘
1FF18
🼙
1FF19
🼚
1FF1A
🼛
1FF1B
🼜
1FF1C
🼝
1FF1D
🼞
1FF1E
🼟
1FF1F
90
A0
🼠
1FF20
🼡
1FF21
🼢
1FF22
🼣
1FF23
🼤
1FF24
🼥
1FF25
🼦
1FF26
🼧
1FF27
🼨
1FF28
🼩
1FF29
🼪
1FF2A
🼫
1FF2B
🼬
1FF2C
🼭
1FF2D
🼮
1FF2E
🼯
1FF2F
A0
B0
🼰
1FF30
🼱
1FF31
🼲
1FF32
🼳
1FF33
🼴
1FF34
🼵
1FF35
🼶
1FF36
🼷
1FF37
🼸
1FF38
🼹
1FF39
🼺
1FF3A
🼻
1FF3B
🼼
1FF3C
🼽
1FF3D
🼾
1FF3E
🼿
1FF3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]