International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38C8D

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󌍀
CC340
󌍁
CC341
󌍂
CC342
󌍃
CC343
󌍄
CC344
󌍅
CC345
󌍆
CC346
󌍇
CC347
󌍈
CC348
󌍉
CC349
󌍊
CC34A
󌍋
CC34B
󌍌
CC34C
󌍍
CC34D
󌍎
CC34E
󌍏
CC34F
80
90
󌍐
CC350
󌍑
CC351
󌍒
CC352
󌍓
CC353
󌍔
CC354
󌍕
CC355
󌍖
CC356
󌍗
CC357
󌍘
CC358
󌍙
CC359
󌍚
CC35A
󌍛
CC35B
󌍜
CC35C
󌍝
CC35D
󌍞
CC35E
󌍟
CC35F
90
A0
󌍠
CC360
󌍡
CC361
󌍢
CC362
󌍣
CC363
󌍤
CC364
󌍥
CC365
󌍦
CC366
󌍧
CC367
󌍨
CC368
󌍩
CC369
󌍪
CC36A
󌍫
CC36B
󌍬
CC36C
󌍭
CC36D
󌍮
CC36E
󌍯
CC36F
A0
B0
󌍰
CC370
󌍱
CC371
󌍲
CC372
󌍳
CC373
󌍴
CC374
󌍵
CC375
󌍶
CC376
󌍷
CC377
󌍸
CC378
󌍹
CC379
󌍺
CC37A
󌍻
CC37B
󌍼
CC37C
󌍽
CC37D
󌍾
CC37E
󌍿
CC37F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]