International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18588

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񅈀
45200
񅈁
45201
񅈂
45202
񅈃
45203
񅈄
45204
񅈅
45205
񅈆
45206
񅈇
45207
񅈈
45208
񅈉
45209
񅈊
4520A
񅈋
4520B
񅈌
4520C
񅈍
4520D
񅈎
4520E
񅈏
4520F
80
90
񅈐
45210
񅈑
45211
񅈒
45212
񅈓
45213
񅈔
45214
񅈕
45215
񅈖
45216
񅈗
45217
񅈘
45218
񅈙
45219
񅈚
4521A
񅈛
4521B
񅈜
4521C
񅈝
4521D
񅈞
4521E
񅈟
4521F
90
A0
񅈠
45220
񅈡
45221
񅈢
45222
񅈣
45223
񅈤
45224
񅈥
45225
񅈦
45226
񅈧
45227
񅈨
45228
񅈩
45229
񅈪
4522A
񅈫
4522B
񅈬
4522C
񅈭
4522D
񅈮
4522E
񅈯
4522F
A0
B0
񅈰
45230
񅈱
45231
񅈲
45232
񅈳
45233
񅈴
45234
񅈵
45235
񅈶
45236
񅈷
45237
񅈸
45238
񅈹
45239
񅈺
4523A
񅈻
4523B
񅈼
4523C
񅈽
4523D
񅈾
4523E
񅈿
4523F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]