International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B582

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𵂀
35080
𵂁
35081
𵂂
35082
𵂃
35083
𵂄
35084
𵂅
35085
𵂆
35086
𵂇
35087
𵂈
35088
𵂉
35089
𵂊
3508A
𵂋
3508B
𵂌
3508C
𵂍
3508D
𵂎
3508E
𵂏
3508F
80
90
𵂐
35090
𵂑
35091
𵂒
35092
𵂓
35093
𵂔
35094
𵂕
35095
𵂖
35096
𵂗
35097
𵂘
35098
𵂙
35099
𵂚
3509A
𵂛
3509B
𵂜
3509C
𵂝
3509D
𵂞
3509E
𵂟
3509F
90
A0
𵂠
350A0
𵂡
350A1
𵂢
350A2
𵂣
350A3
𵂤
350A4
𵂥
350A5
𵂦
350A6
𵂧
350A7
𵂨
350A8
𵂩
350A9
𵂪
350AA
𵂫
350AB
𵂬
350AC
𵂭
350AD
𵂮
350AE
𵂯
350AF
A0
B0
𵂰
350B0
𵂱
350B1
𵂲
350B2
𵂳
350B3
𵂴
350B4
𵂵
350B5
𵂶
350B6
𵂷
350B7
𵂸
350B8
𵂹
350B9
𵂺
350BA
𵂻
350BB
𵂼
350BC
𵂽
350BD
𵂾
350BE
𵂿
350BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]