International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B989

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𹉀
39240
𹉁
39241
𹉂
39242
𹉃
39243
𹉄
39244
𹉅
39245
𹉆
39246
𹉇
39247
𹉈
39248
𹉉
39249
𹉊
3924A
𹉋
3924B
𹉌
3924C
𹉍
3924D
𹉎
3924E
𹉏
3924F
80
90
𹉐
39250
𹉑
39251
𹉒
39252
𹉓
39253
𹉔
39254
𹉕
39255
𹉖
39256
𹉗
39257
𹉘
39258
𹉙
39259
𹉚
3925A
𹉛
3925B
𹉜
3925C
𹉝
3925D
𹉞
3925E
𹉟
3925F
90
A0
𹉠
39260
𹉡
39261
𹉢
39262
𹉣
39263
𹉤
39264
𹉥
39265
𹉦
39266
𹉧
39267
𹉨
39268
𹉩
39269
𹉪
3926A
𹉫
3926B
𹉬
3926C
𹉭
3926D
𹉮
3926E
𹉯
3926F
A0
B0
𹉰
39270
𹉱
39271
𹉲
39272
𹉳
39273
𹉴
39274
𹉵
39275
𹉶
39276
𹉷
39277
𹉸
39278
𹉹
39279
𹉺
3927A
𹉻
3927B
𹉼
3927C
𹉽
3927D
𹉾
3927E
𹉿
3927F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]