International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1B294

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񲔀
72500
񲔁
72501
񲔂
72502
񲔃
72503
񲔄
72504
񲔅
72505
񲔆
72506
񲔇
72507
񲔈
72508
񲔉
72509
񲔊
7250A
񲔋
7250B
񲔌
7250C
񲔍
7250D
񲔎
7250E
񲔏
7250F
80
90
񲔐
72510
񲔑
72511
񲔒
72512
񲔓
72513
񲔔
72514
񲔕
72515
񲔖
72516
񲔗
72517
񲔘
72518
񲔙
72519
񲔚
7251A
񲔛
7251B
񲔜
7251C
񲔝
7251D
񲔞
7251E
񲔟
7251F
90
A0
񲔠
72520
񲔡
72521
񲔢
72522
񲔣
72523
񲔤
72524
񲔥
72525
񲔦
72526
񲔧
72527
񲔨
72528
񲔩
72529
񲔪
7252A
񲔫
7252B
񲔬
7252C
񲔭
7252D
񲔮
7252E
񲔯
7252F
A0
B0
񲔰
72530
񲔱
72531
񲔲
72532
񲔳
72533
񲔴
72534
񲔵
72535
񲔶
72536
񲔷
72537
񲔸
72538
񲔹
72539
񲔺
7253A
񲔻
7253B
񲔼
7253C
񲔽
7253D
񲔾
7253E
񲔿
7253F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]