International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29593

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򕓀
954C0
򕓁
954C1
򕓂
954C2
򕓃
954C3
򕓄
954C4
򕓅
954C5
򕓆
954C6
򕓇
954C7
򕓈
954C8
򕓉
954C9
򕓊
954CA
򕓋
954CB
򕓌
954CC
򕓍
954CD
򕓎
954CE
򕓏
954CF
80
90
򕓐
954D0
򕓑
954D1
򕓒
954D2
򕓓
954D3
򕓔
954D4
򕓕
954D5
򕓖
954D6
򕓗
954D7
򕓘
954D8
򕓙
954D9
򕓚
954DA
򕓛
954DB
򕓜
954DC
򕓝
954DD
򕓞
954DE
򕓟
954DF
90
A0
򕓠
954E0
򕓡
954E1
򕓢
954E2
򕓣
954E3
򕓤
954E4
򕓥
954E5
򕓦
954E6
򕓧
954E7
򕓨
954E8
򕓩
954E9
򕓪
954EA
򕓫
954EB
򕓬
954EC
򕓭
954ED
򕓮
954EE
򕓯
954EF
A0
B0
򕓰
954F0
򕓱
954F1
򕓲
954F2
򕓳
954F3
򕓴
954F4
򕓵
954F5
򕓶
954F6
򕓷
954F7
򕓸
954F8
򕓹
954F9
򕓺
954FA
򕓻
954FB
򕓼
954FC
򕓽
954FD
򕓾
954FE
򕓿
954FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]