International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A0BF

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󠿀
E0FC0
󠿁
E0FC1
󠿂
E0FC2
󠿃
E0FC3
󠿄
E0FC4
󠿅
E0FC5
󠿆
E0FC6
󠿇
E0FC7
󠿈
E0FC8
󠿉
E0FC9
󠿊
E0FCA
󠿋
E0FCB
󠿌
E0FCC
󠿍
E0FCD
󠿎
E0FCE
󠿏
E0FCF
80
90
󠿐
E0FD0
󠿑
E0FD1
󠿒
E0FD2
󠿓
E0FD3
󠿔
E0FD4
󠿕
E0FD5
󠿖
E0FD6
󠿗
E0FD7
󠿘
E0FD8
󠿙
E0FD9
󠿚
E0FDA
󠿛
E0FDB
󠿜
E0FDC
󠿝
E0FDD
󠿞
E0FDE
󠿟
E0FDF
90
A0
󠿠
E0FE0
󠿡
E0FE1
󠿢
E0FE2
󠿣
E0FE3
󠿤
E0FE4
󠿥
E0FE5
󠿦
E0FE6
󠿧
E0FE7
󠿨
E0FE8
󠿩
E0FE9
󠿪
E0FEA
󠿫
E0FEB
󠿬
E0FEC
󠿭
E0FED
󠿮
E0FEE
󠿯
E0FEF
A0
B0
󠿰
E0FF0
󠿱
E0FF1
󠿲
E0FF2
󠿳
E0FF3
󠿴
E0FF4
󠿵
E0FF5
󠿶
E0FF6
󠿷
E0FF7
󠿸
E0FF8
󠿹
E0FF9
󠿺
E0FFA
󠿻
E0FFB
󠿼
E0FFC
󠿽
E0FFD
󠿾
E0FFE
󠿿
E0FFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]