International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A989

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򩉀
A9240
򩉁
A9241
򩉂
A9242
򩉃
A9243
򩉄
A9244
򩉅
A9245
򩉆
A9246
򩉇
A9247
򩉈
A9248
򩉉
A9249
򩉊
A924A
򩉋
A924B
򩉌
A924C
򩉍
A924D
򩉎
A924E
򩉏
A924F
80
90
򩉐
A9250
򩉑
A9251
򩉒
A9252
򩉓
A9253
򩉔
A9254
򩉕
A9255
򩉖
A9256
򩉗
A9257
򩉘
A9258
򩉙
A9259
򩉚
A925A
򩉛
A925B
򩉜
A925C
򩉝
A925D
򩉞
A925E
򩉟
A925F
90
A0
򩉠
A9260
򩉡
A9261
򩉢
A9262
򩉣
A9263
򩉤
A9264
򩉥
A9265
򩉦
A9266
򩉧
A9267
򩉨
A9268
򩉩
A9269
򩉪
A926A
򩉫
A926B
򩉬
A926C
򩉭
A926D
򩉮
A926E
򩉯
A926F
A0
B0
򩉰
A9270
򩉱
A9271
򩉲
A9272
򩉳
A9273
򩉴
A9274
򩉵
A9275
򩉶
A9276
򩉷
A9277
򩉸
A9278
򩉹
A9279
򩉺
A927A
򩉻
A927B
򩉼
A927C
򩉽
A927D
򩉾
A927E
򩉿
A927F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]