International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A88A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󨊀
E8280
󨊁
E8281
󨊂
E8282
󨊃
E8283
󨊄
E8284
󨊅
E8285
󨊆
E8286
󨊇
E8287
󨊈
E8288
󨊉
E8289
󨊊
E828A
󨊋
E828B
󨊌
E828C
󨊍
E828D
󨊎
E828E
󨊏
E828F
80
90
󨊐
E8290
󨊑
E8291
󨊒
E8292
󨊓
E8293
󨊔
E8294
󨊕
E8295
󨊖
E8296
󨊗
E8297
󨊘
E8298
󨊙
E8299
󨊚
E829A
󨊛
E829B
󨊜
E829C
󨊝
E829D
󨊞
E829E
󨊟
E829F
90
A0
󨊠
E82A0
󨊡
E82A1
󨊢
E82A2
󨊣
E82A3
󨊤
E82A4
󨊥
E82A5
󨊦
E82A6
󨊧
E82A7
󨊨
E82A8
󨊩
E82A9
󨊪
E82AA
󨊫
E82AB
󨊬
E82AC
󨊭
E82AD
󨊮
E82AE
󨊯
E82AF
A0
B0
󨊰
E82B0
󨊱
E82B1
󨊲
E82B2
󨊳
E82B3
󨊴
E82B4
󨊵
E82B5
󨊶
E82B6
󨊷
E82B7
󨊸
E82B8
󨊹
E82B9
󨊺
E82BA
󨊻
E82BB
󨊼
E82BC
󨊽
E82BD
󨊾
E82BE
󨊿
E82BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]