International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A48B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󤋀
E42C0
󤋁
E42C1
󤋂
E42C2
󤋃
E42C3
󤋄
E42C4
󤋅
E42C5
󤋆
E42C6
󤋇
E42C7
󤋈
E42C8
󤋉
E42C9
󤋊
E42CA
󤋋
E42CB
󤋌
E42CC
󤋍
E42CD
󤋎
E42CE
󤋏
E42CF
80
90
󤋐
E42D0
󤋑
E42D1
󤋒
E42D2
󤋓
E42D3
󤋔
E42D4
󤋕
E42D5
󤋖
E42D6
󤋗
E42D7
󤋘
E42D8
󤋙
E42D9
󤋚
E42DA
󤋛
E42DB
󤋜
E42DC
󤋝
E42DD
󤋞
E42DE
󤋟
E42DF
90
A0
󤋠
E42E0
󤋡
E42E1
󤋢
E42E2
󤋣
E42E3
󤋤
E42E4
󤋥
E42E5
󤋦
E42E6
󤋧
E42E7
󤋨
E42E8
󤋩
E42E9
󤋪
E42EA
󤋫
E42EB
󤋬
E42EC
󤋭
E42ED
󤋮
E42EE
󤋯
E42EF
A0
B0
󤋰
E42F0
󤋱
E42F1
󤋲
E42F2
󤋳
E42F3
󤋴
E42F4
󤋵
E42F5
󤋶
E42F6
󤋷
E42F7
󤋸
E42F8
󤋹
E42F9
󤋺
E42FA
󤋻
E42FB
󤋼
E42FC
󤋽
E42FD
󤋾
E42FE
󤋿
E42FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]