International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3889B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󈛀
C86C0
󈛁
C86C1
󈛂
C86C2
󈛃
C86C3
󈛄
C86C4
󈛅
C86C5
󈛆
C86C6
󈛇
C86C7
󈛈
C86C8
󈛉
C86C9
󈛊
C86CA
󈛋
C86CB
󈛌
C86CC
󈛍
C86CD
󈛎
C86CE
󈛏
C86CF
80
90
󈛐
C86D0
󈛑
C86D1
󈛒
C86D2
󈛓
C86D3
󈛔
C86D4
󈛕
C86D5
󈛖
C86D6
󈛗
C86D7
󈛘
C86D8
󈛙
C86D9
󈛚
C86DA
󈛛
C86DB
󈛜
C86DC
󈛝
C86DD
󈛞
C86DE
󈛟
C86DF
90
A0
󈛠
C86E0
󈛡
C86E1
󈛢
C86E2
󈛣
C86E3
󈛤
C86E4
󈛥
C86E5
󈛦
C86E6
󈛧
C86E7
󈛨
C86E8
󈛩
C86E9
󈛪
C86EA
󈛫
C86EB
󈛬
C86EC
󈛭
C86ED
󈛮
C86EE
󈛯
C86EF
A0
B0
󈛰
C86F0
󈛱
C86F1
󈛲
C86F2
󈛳
C86F3
󈛴
C86F4
󈛵
C86F5
󈛶
C86F6
󈛷
C86F7
󈛸
C86F8
󈛹
C86F9
󈛺
C86FA
󈛻
C86FB
󈛼
C86FC
󈛽
C86FD
󈛾
C86FE
󈛿
C86FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]