International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1908C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񐌀
50300
񐌁
50301
񐌂
50302
񐌃
50303
񐌄
50304
񐌅
50305
񐌆
50306
񐌇
50307
񐌈
50308
񐌉
50309
񐌊
5030A
񐌋
5030B
񐌌
5030C
񐌍
5030D
񐌎
5030E
񐌏
5030F
80
90
񐌐
50310
񐌑
50311
񐌒
50312
񐌓
50313
񐌔
50314
񐌕
50315
񐌖
50316
񐌗
50317
񐌘
50318
񐌙
50319
񐌚
5031A
񐌛
5031B
񐌜
5031C
񐌝
5031D
񐌞
5031E
񐌟
5031F
90
A0
񐌠
50320
񐌡
50321
񐌢
50322
񐌣
50323
񐌤
50324
񐌥
50325
񐌦
50326
񐌧
50327
񐌨
50328
񐌩
50329
񐌪
5032A
񐌫
5032B
񐌬
5032C
񐌭
5032D
񐌮
5032E
񐌯
5032F
A0
B0
񐌰
50330
񐌱
50331
񐌲
50332
񐌳
50333
񐌴
50334
񐌵
50335
񐌶
50336
񐌷
50337
񐌸
50338
񐌹
50339
񐌺
5033A
񐌻
5033B
񐌼
5033C
񐌽
5033D
񐌾
5033E
񐌿
5033F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]