International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B082

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󰂀
F0080
󰂁
F0081
󰂂
F0082
󰂃
F0083
󰂄
F0084
󰂅
F0085
󰂆
F0086
󰂇
F0087
󰂈
F0088
󰂉
F0089
󰂊
F008A
󰂋
F008B
󰂌
F008C
󰂍
F008D
󰂎
F008E
󰂏
F008F
80
90
󰂐
F0090
󰂑
F0091
󰂒
F0092
󰂓
F0093
󰂔
F0094
󰂕
F0095
󰂖
F0096
󰂗
F0097
󰂘
F0098
󰂙
F0099
󰂚
F009A
󰂛
F009B
󰂜
F009C
󰂝
F009D
󰂞
F009E
󰂟
F009F
90
A0
󰂠
F00A0
󰂡
F00A1
󰂢
F00A2
󰂣
F00A3
󰂤
F00A4
󰂥
F00A5
󰂦
F00A6
󰂧
F00A7
󰂨
F00A8
󰂩
F00A9
󰂪
F00AA
󰂫
F00AB
󰂬
F00AC
󰂭
F00AD
󰂮
F00AE
󰂯
F00AF
A0
B0
󰂰
F00B0
󰂱
F00B1
󰂲
F00B2
󰂳
F00B3
󰂴
F00B4
󰂵
F00B5
󰂶
F00B6
󰂷
F00B7
󰂸
F00B8
󰂹
F00B9
󰂺
F00BA
󰂻
F00BB
󰂼
F00BC
󰂽
F00BD
󰂾
F00BE
󰂿
F00BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]