International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B28B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𲋀
322C0
𲋁
322C1
𲋂
322C2
𲋃
322C3
𲋄
322C4
𲋅
322C5
𲋆
322C6
𲋇
322C7
𲋈
322C8
𲋉
322C9
𲋊
322CA
𲋋
322CB
𲋌
322CC
𲋍
322CD
𲋎
322CE
𲋏
322CF
80
90
𲋐
322D0
𲋑
322D1
𲋒
322D2
𲋓
322D3
𲋔
322D4
𲋕
322D5
𲋖
322D6
𲋗
322D7
𲋘
322D8
𲋙
322D9
𲋚
322DA
𲋛
322DB
𲋜
322DC
𲋝
322DD
𲋞
322DE
𲋟
322DF
90
A0
𲋠
322E0
𲋡
322E1
𲋢
322E2
𲋣
322E3
𲋤
322E4
𲋥
322E5
𲋦
322E6
𲋧
322E7
𲋨
322E8
𲋩
322E9
𲋪
322EA
𲋫
322EB
𲋬
322EC
𲋭
322ED
𲋮
322EE
𲋯
322EF
A0
B0
𲋰
322F0
𲋱
322F1
𲋲
322F2
𲋳
322F3
𲋴
322F4
𲋵
322F5
𲋶
322F6
𲋷
322F7
𲋸
322F8
𲋹
322F9
𲋺
322FA
𲋻
322FB
𲋼
322FC
𲋽
322FD
𲋾
322FE
𲋿
322FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]