International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F384A3

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󄣀
C48C0
󄣁
C48C1
󄣂
C48C2
󄣃
C48C3
󄣄
C48C4
󄣅
C48C5
󄣆
C48C6
󄣇
C48C7
󄣈
C48C8
󄣉
C48C9
󄣊
C48CA
󄣋
C48CB
󄣌
C48CC
󄣍
C48CD
󄣎
C48CE
󄣏
C48CF
80
90
󄣐
C48D0
󄣑
C48D1
󄣒
C48D2
󄣓
C48D3
󄣔
C48D4
󄣕
C48D5
󄣖
C48D6
󄣗
C48D7
󄣘
C48D8
󄣙
C48D9
󄣚
C48DA
󄣛
C48DB
󄣜
C48DC
󄣝
C48DD
󄣞
C48DE
󄣟
C48DF
90
A0
󄣠
C48E0
󄣡
C48E1
󄣢
C48E2
󄣣
C48E3
󄣤
C48E4
󄣥
C48E5
󄣦
C48E6
󄣧
C48E7
󄣨
C48E8
󄣩
C48E9
󄣪
C48EA
󄣫
C48EB
󄣬
C48EC
󄣭
C48ED
󄣮
C48EE
󄣯
C48EF
A0
B0
󄣰
C48F0
󄣱
C48F1
󄣲
C48F2
󄣳
C48F3
󄣴
C48F4
󄣵
C48F5
󄣶
C48F6
󄣷
C48F7
󄣸
C48F8
󄣹
C48F9
󄣺
C48FA
󄣻
C48FB
󄣼
C48FC
󄣽
C48FD
󄣾
C48FE
󄣿
C48FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]