International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28994

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򉔀
89500
򉔁
89501
򉔂
89502
򉔃
89503
򉔄
89504
򉔅
89505
򉔆
89506
򉔇
89507
򉔈
89508
򉔉
89509
򉔊
8950A
򉔋
8950B
򉔌
8950C
򉔍
8950D
򉔎
8950E
򉔏
8950F
80
90
򉔐
89510
򉔑
89511
򉔒
89512
򉔓
89513
򉔔
89514
򉔕
89515
򉔖
89516
򉔗
89517
򉔘
89518
򉔙
89519
򉔚
8951A
򉔛
8951B
򉔜
8951C
򉔝
8951D
򉔞
8951E
򉔟
8951F
90
A0
򉔠
89520
򉔡
89521
򉔢
89522
򉔣
89523
򉔤
89524
򉔥
89525
򉔦
89526
򉔧
89527
򉔨
89528
򉔩
89529
򉔪
8952A
򉔫
8952B
򉔬
8952C
򉔭
8952D
򉔮
8952E
򉔯
8952F
A0
B0
򉔰
89530
򉔱
89531
򉔲
89532
򉔳
89533
򉔴
89534
򉔵
89535
򉔶
89536
򉔷
89537
򉔸
89538
򉔹
89539
򉔺
8953A
򉔻
8953B
򉔼
8953C
򉔽
8953D
򉔾
8953E
򉔿
8953F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]