International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A4A3

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󤣀
E48C0
󤣁
E48C1
󤣂
E48C2
󤣃
E48C3
󤣄
E48C4
󤣅
E48C5
󤣆
E48C6
󤣇
E48C7
󤣈
E48C8
󤣉
E48C9
󤣊
E48CA
󤣋
E48CB
󤣌
E48CC
󤣍
E48CD
󤣎
E48CE
󤣏
E48CF
80
90
󤣐
E48D0
󤣑
E48D1
󤣒
E48D2
󤣓
E48D3
󤣔
E48D4
󤣕
E48D5
󤣖
E48D6
󤣗
E48D7
󤣘
E48D8
󤣙
E48D9
󤣚
E48DA
󤣛
E48DB
󤣜
E48DC
󤣝
E48DD
󤣞
E48DE
󤣟
E48DF
90
A0
󤣠
E48E0
󤣡
E48E1
󤣢
E48E2
󤣣
E48E3
󤣤
E48E4
󤣥
E48E5
󤣦
E48E6
󤣧
E48E7
󤣨
E48E8
󤣩
E48E9
󤣪
E48EA
󤣫
E48EB
󤣬
E48EC
󤣭
E48ED
󤣮
E48EE
󤣯
E48EF
A0
B0
󤣰
E48F0
󤣱
E48F1
󤣲
E48F2
󤣳
E48F3
󤣴
E48F4
󤣵
E48F5
󤣶
E48F6
󤣷
E48F7
󤣸
E48F8
󤣹
E48F9
󤣺
E48FA
󤣻
E48FB
󤣼
E48FC
󤣽
E48FD
󤣾
E48FE
󤣿
E48FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]