International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0A38C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𣌀
23300
𣌁
23301
𣌂
23302
𣌃
23303
𣌄
23304
𣌅
23305
𣌆
23306
𣌇
23307
𣌈
23308
𣌉
23309
𣌊
2330A
𣌋
2330B
𣌌
2330C
𣌍
2330D
𣌎
2330E
𣌏
2330F
80
90
𣌐
23310
𣌑
23311
𣌒
23312
𣌓
23313
𣌔
23314
𣌕
23315
𣌖
23316
𣌗
23317
𣌘
23318
𣌙
23319
𣌚
2331A
𣌛
2331B
𣌜
2331C
𣌝
2331D
𣌞
2331E
𣌟
2331F
90
A0
𣌠
23320
𣌡
23321
𣌢
23322
𣌣
23323
𣌤
23324
𣌥
23325
𣌦
23326
𣌧
23327
𣌨
23328
𣌩
23329
𣌪
2332A
𣌫
2332B
𣌬
2332C
𣌭
2332D
𣌮
2332E
𣌯
2332F
A0
B0
𣌰
23330
𣌱
23331
𣌲
23332
𣌳
23333
𣌴
23334
𣌵
23335
𣌶
23336
𣌷
23337
𣌸
23338
𣌹
23339
𣌺
2333A
𣌻
2333B
𣌼
2333C
𣌽
2333D
𣌾
2333E
𣌿
2333F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]