International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09E80

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
 𞀀
1E000
 𞀁
1E001
 𞀂
1E002
 𞀃
1E003
 𞀄
1E004
 𞀅
1E005
 𞀆
1E006
𞀇
1E007
 𞀈
1E008
 𞀉
1E009
 𞀊
1E00A
 𞀋
1E00B
 𞀌
1E00C
 𞀍
1E00D
 𞀎
1E00E
 𞀏
1E00F
80
90
 𞀐
1E010
 𞀑
1E011
 𞀒
1E012
 𞀓
1E013
 𞀔
1E014
 𞀕
1E015
 𞀖
1E016
 𞀗
1E017
 𞀘
1E018
𞀙
1E019
𞀚
1E01A
 𞀛
1E01B
 𞀜
1E01C
 𞀝
1E01D
 𞀞
1E01E
 𞀟
1E01F
90
A0
 𞀠
1E020
 𞀡
1E021
𞀢
1E022
 𞀣
1E023
 𞀤
1E024
𞀥
1E025
 𞀦
1E026
 𞀧
1E027
 𞀨
1E028
 𞀩
1E029
 𞀪
1E02A
𞀫
1E02B
𞀬
1E02C
𞀭
1E02D
𞀮
1E02E
𞀯
1E02F
A0
B0
𞀰
1E030
𞀱
1E031
𞀲
1E032
𞀳
1E033
𞀴
1E034
𞀵
1E035
𞀶
1E036
𞀷
1E037
𞀸
1E038
𞀹
1E039
𞀺
1E03A
𞀻
1E03B
𞀼
1E03C
𞀽
1E03D
𞀾
1E03E
𞀿
1E03F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]