International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48B82

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􋂀
10B080
􋂁
10B081
􋂂
10B082
􋂃
10B083
􋂄
10B084
􋂅
10B085
􋂆
10B086
􋂇
10B087
􋂈
10B088
􋂉
10B089
􋂊
10B08A
􋂋
10B08B
􋂌
10B08C
􋂍
10B08D
􋂎
10B08E
􋂏
10B08F
80
90
􋂐
10B090
􋂑
10B091
􋂒
10B092
􋂓
10B093
􋂔
10B094
􋂕
10B095
􋂖
10B096
􋂗
10B097
􋂘
10B098
􋂙
10B099
􋂚
10B09A
􋂛
10B09B
􋂜
10B09C
􋂝
10B09D
􋂞
10B09E
􋂟
10B09F
90
A0
􋂠
10B0A0
􋂡
10B0A1
􋂢
10B0A2
􋂣
10B0A3
􋂤
10B0A4
􋂥
10B0A5
􋂦
10B0A6
􋂧
10B0A7
􋂨
10B0A8
􋂩
10B0A9
􋂪
10B0AA
􋂫
10B0AB
􋂬
10B0AC
􋂭
10B0AD
􋂮
10B0AE
􋂯
10B0AF
A0
B0
􋂰
10B0B0
􋂱
10B0B1
􋂲
10B0B2
􋂳
10B0B3
􋂴
10B0B4
􋂵
10B0B5
􋂶
10B0B6
􋂷
10B0B7
􋂸
10B0B8
􋂹
10B0B9
􋂺
10B0BA
􋂻
10B0BB
􋂼
10B0BC
􋂽
10B0BD
􋂾
10B0BE
􋂿
10B0BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]