International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29786

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򗆀
97180
򗆁
97181
򗆂
97182
򗆃
97183
򗆄
97184
򗆅
97185
򗆆
97186
򗆇
97187
򗆈
97188
򗆉
97189
򗆊
9718A
򗆋
9718B
򗆌
9718C
򗆍
9718D
򗆎
9718E
򗆏
9718F
80
90
򗆐
97190
򗆑
97191
򗆒
97192
򗆓
97193
򗆔
97194
򗆕
97195
򗆖
97196
򗆗
97197
򗆘
97198
򗆙
97199
򗆚
9719A
򗆛
9719B
򗆜
9719C
򗆝
9719D
򗆞
9719E
򗆟
9719F
90
A0
򗆠
971A0
򗆡
971A1
򗆢
971A2
򗆣
971A3
򗆤
971A4
򗆥
971A5
򗆦
971A6
򗆧
971A7
򗆨
971A8
򗆩
971A9
򗆪
971AA
򗆫
971AB
򗆬
971AC
򗆭
971AD
򗆮
971AE
򗆯
971AF
A0
B0
򗆰
971B0
򗆱
971B1
򗆲
971B2
򗆳
971B3
򗆴
971B4
򗆵
971B5
򗆶
971B6
򗆷
971B7
򗆸
971B8
򗆹
971B9
򗆺
971BA
򗆻
971BB
򗆼
971BC
򗆽
971BD
򗆾
971BE
򗆿
971BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]