International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F384AF

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󄯀
C4BC0
󄯁
C4BC1
󄯂
C4BC2
󄯃
C4BC3
󄯄
C4BC4
󄯅
C4BC5
󄯆
C4BC6
󄯇
C4BC7
󄯈
C4BC8
󄯉
C4BC9
󄯊
C4BCA
󄯋
C4BCB
󄯌
C4BCC
󄯍
C4BCD
󄯎
C4BCE
󄯏
C4BCF
80
90
󄯐
C4BD0
󄯑
C4BD1
󄯒
C4BD2
󄯓
C4BD3
󄯔
C4BD4
󄯕
C4BD5
󄯖
C4BD6
󄯗
C4BD7
󄯘
C4BD8
󄯙
C4BD9
󄯚
C4BDA
󄯛
C4BDB
󄯜
C4BDC
󄯝
C4BDD
󄯞
C4BDE
󄯟
C4BDF
90
A0
󄯠
C4BE0
󄯡
C4BE1
󄯢
C4BE2
󄯣
C4BE3
󄯤
C4BE4
󄯥
C4BE5
󄯦
C4BE6
󄯧
C4BE7
󄯨
C4BE8
󄯩
C4BE9
󄯪
C4BEA
󄯫
C4BEB
󄯬
C4BEC
󄯭
C4BED
󄯮
C4BEE
󄯯
C4BEF
A0
B0
󄯰
C4BF0
󄯱
C4BF1
󄯲
C4BF2
󄯳
C4BF3
󄯴
C4BF4
󄯵
C4BF5
󄯶
C4BF6
󄯷
C4BF7
󄯸
C4BF8
󄯹
C4BF9
󄯺
C4BFA
󄯻
C4BFB
󄯼
C4BFC
󄯽
C4BFD
󄯾
C4BFE
󄯿
C4BFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]