International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09C82

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𜂀
1C080
𜂁
1C081
𜂂
1C082
𜂃
1C083
𜂄
1C084
𜂅
1C085
𜂆
1C086
𜂇
1C087
𜂈
1C088
𜂉
1C089
𜂊
1C08A
𜂋
1C08B
𜂌
1C08C
𜂍
1C08D
𜂎
1C08E
𜂏
1C08F
80
90
𜂐
1C090
𜂑
1C091
𜂒
1C092
𜂓
1C093
𜂔
1C094
𜂕
1C095
𜂖
1C096
𜂗
1C097
𜂘
1C098
𜂙
1C099
𜂚
1C09A
𜂛
1C09B
𜂜
1C09C
𜂝
1C09D
𜂞
1C09E
𜂟
1C09F
90
A0
𜂠
1C0A0
𜂡
1C0A1
𜂢
1C0A2
𜂣
1C0A3
𜂤
1C0A4
𜂥
1C0A5
𜂦
1C0A6
𜂧
1C0A7
𜂨
1C0A8
𜂩
1C0A9
𜂪
1C0AA
𜂫
1C0AB
𜂬
1C0AC
𜂭
1C0AD
𜂮
1C0AE
𜂯
1C0AF
A0
B0
𜂰
1C0B0
𜂱
1C0B1
𜂲
1C0B2
𜂳
1C0B3
𜂴
1C0B4
𜂵
1C0B5
𜂶
1C0B6
𜂷
1C0B7
𜂸
1C0B8
𜂹
1C0B9
𜂺
1C0BA
𜂻
1C0BB
𜂼
1C0BC
𜂽
1C0BD
𜂾
1C0BE
𜂿
1C0BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]