International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A08C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򠌀
A0300
򠌁
A0301
򠌂
A0302
򠌃
A0303
򠌄
A0304
򠌅
A0305
򠌆
A0306
򠌇
A0307
򠌈
A0308
򠌉
A0309
򠌊
A030A
򠌋
A030B
򠌌
A030C
򠌍
A030D
򠌎
A030E
򠌏
A030F
80
90
򠌐
A0310
򠌑
A0311
򠌒
A0312
򠌓
A0313
򠌔
A0314
򠌕
A0315
򠌖
A0316
򠌗
A0317
򠌘
A0318
򠌙
A0319
򠌚
A031A
򠌛
A031B
򠌜
A031C
򠌝
A031D
򠌞
A031E
򠌟
A031F
90
A0
򠌠
A0320
򠌡
A0321
򠌢
A0322
򠌣
A0323
򠌤
A0324
򠌥
A0325
򠌦
A0326
򠌧
A0327
򠌨
A0328
򠌩
A0329
򠌪
A032A
򠌫
A032B
򠌬
A032C
򠌭
A032D
򠌮
A032E
򠌯
A032F
A0
B0
򠌰
A0330
򠌱
A0331
򠌲
A0332
򠌳
A0333
򠌴
A0334
򠌵
A0335
򠌶
A0336
򠌷
A0337
򠌸
A0338
򠌹
A0339
򠌺
A033A
򠌻
A033B
򠌼
A033C
򠌽
A033D
򠌾
A033E
򠌿
A033F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]