International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3BF93

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󿓀
FF4C0
󿓁
FF4C1
󿓂
FF4C2
󿓃
FF4C3
󿓄
FF4C4
󿓅
FF4C5
󿓆
FF4C6
󿓇
FF4C7
󿓈
FF4C8
󿓉
FF4C9
󿓊
FF4CA
󿓋
FF4CB
󿓌
FF4CC
󿓍
FF4CD
󿓎
FF4CE
󿓏
FF4CF
80
90
󿓐
FF4D0
󿓑
FF4D1
󿓒
FF4D2
󿓓
FF4D3
󿓔
FF4D4
󿓕
FF4D5
󿓖
FF4D6
󿓗
FF4D7
󿓘
FF4D8
󿓙
FF4D9
󿓚
FF4DA
󿓛
FF4DB
󿓜
FF4DC
󿓝
FF4DD
󿓞
FF4DE
󿓟
FF4DF
90
A0
󿓠
FF4E0
󿓡
FF4E1
󿓢
FF4E2
󿓣
FF4E3
󿓤
FF4E4
󿓥
FF4E5
󿓦
FF4E6
󿓧
FF4E7
󿓨
FF4E8
󿓩
FF4E9
󿓪
FF4EA
󿓫
FF4EB
󿓬
FF4EC
󿓭
FF4ED
󿓮
FF4EE
󿓯
FF4EF
A0
B0
󿓰
FF4F0
󿓱
FF4F1
󿓲
FF4F2
󿓳
FF4F3
󿓴
FF4F4
󿓵
FF4F5
󿓶
FF4F6
󿓷
FF4F7
󿓸
FF4F8
󿓹
FF4F9
󿓺
FF4FA
󿓻
FF4FB
󿓼
FF4FC
󿓽
FF4FD
󿓾
FF4FE
󿓿
FF4FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]