International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A987

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󩇀
E91C0
󩇁
E91C1
󩇂
E91C2
󩇃
E91C3
󩇄
E91C4
󩇅
E91C5
󩇆
E91C6
󩇇
E91C7
󩇈
E91C8
󩇉
E91C9
󩇊
E91CA
󩇋
E91CB
󩇌
E91CC
󩇍
E91CD
󩇎
E91CE
󩇏
E91CF
80
90
󩇐
E91D0
󩇑
E91D1
󩇒
E91D2
󩇓
E91D3
󩇔
E91D4
󩇕
E91D5
󩇖
E91D6
󩇗
E91D7
󩇘
E91D8
󩇙
E91D9
󩇚
E91DA
󩇛
E91DB
󩇜
E91DC
󩇝
E91DD
󩇞
E91DE
󩇟
E91DF
90
A0
󩇠
E91E0
󩇡
E91E1
󩇢
E91E2
󩇣
E91E3
󩇤
E91E4
󩇥
E91E5
󩇦
E91E6
󩇧
E91E7
󩇨
E91E8
󩇩
E91E9
󩇪
E91EA
󩇫
E91EB
󩇬
E91EC
󩇭
E91ED
󩇮
E91EE
󩇯
E91EF
A0
B0
󩇰
E91F0
󩇱
E91F1
󩇲
E91F2
󩇳
E91F3
󩇴
E91F4
󩇵
E91F5
󩇶
E91F6
󩇷
E91F7
󩇸
E91F8
󩇹
E91F9
󩇺
E91FA
󩇻
E91FB
󩇼
E91FC
󩇽
E91FD
󩇾
E91FE
󩇿
E91FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]