International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2868D

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򆍀
86340
򆍁
86341
򆍂
86342
򆍃
86343
򆍄
86344
򆍅
86345
򆍆
86346
򆍇
86347
򆍈
86348
򆍉
86349
򆍊
8634A
򆍋
8634B
򆍌
8634C
򆍍
8634D
򆍎
8634E
򆍏
8634F
80
90
򆍐
86350
򆍑
86351
򆍒
86352
򆍓
86353
򆍔
86354
򆍕
86355
򆍖
86356
򆍗
86357
򆍘
86358
򆍙
86359
򆍚
8635A
򆍛
8635B
򆍜
8635C
򆍝
8635D
򆍞
8635E
򆍟
8635F
90
A0
򆍠
86360
򆍡
86361
򆍢
86362
򆍣
86363
򆍤
86364
򆍥
86365
򆍦
86366
򆍧
86367
򆍨
86368
򆍩
86369
򆍪
8636A
򆍫
8636B
򆍬
8636C
򆍭
8636D
򆍮
8636E
򆍯
8636F
A0
B0
򆍰
86370
򆍱
86371
򆍲
86372
򆍳
86373
򆍴
86374
򆍵
86375
򆍶
86376
򆍷
86377
򆍸
86378
򆍹
86379
򆍺
8637A
򆍻
8637B
򆍼
8637C
򆍽
8637D
򆍾
8637E
򆍿
8637F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]