International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38394

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󃔀
C3500
󃔁
C3501
󃔂
C3502
󃔃
C3503
󃔄
C3504
󃔅
C3505
󃔆
C3506
󃔇
C3507
󃔈
C3508
󃔉
C3509
󃔊
C350A
󃔋
C350B
󃔌
C350C
󃔍
C350D
󃔎
C350E
󃔏
C350F
80
90
󃔐
C3510
󃔑
C3511
󃔒
C3512
󃔓
C3513
󃔔
C3514
󃔕
C3515
󃔖
C3516
󃔗
C3517
󃔘
C3518
󃔙
C3519
󃔚
C351A
󃔛
C351B
󃔜
C351C
󃔝
C351D
󃔞
C351E
󃔟
C351F
90
A0
󃔠
C3520
󃔡
C3521
󃔢
C3522
󃔣
C3523
󃔤
C3524
󃔥
C3525
󃔦
C3526
󃔧
C3527
󃔨
C3528
󃔩
C3529
󃔪
C352A
󃔫
C352B
󃔬
C352C
󃔭
C352D
󃔮
C352E
󃔯
C352F
A0
B0
󃔰
C3530
󃔱
C3531
󃔲
C3532
󃔳
C3533
󃔴
C3534
󃔵
C3535
󃔶
C3536
󃔷
C3537
󃔸
C3538
󃔹
C3539
󃔺
C353A
󃔻
C353B
󃔼
C353C
󃔽
C353D
󃔾
C353E
󃔿
C353F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]