International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3839D

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󃝀
C3740
󃝁
C3741
󃝂
C3742
󃝃
C3743
󃝄
C3744
󃝅
C3745
󃝆
C3746
󃝇
C3747
󃝈
C3748
󃝉
C3749
󃝊
C374A
󃝋
C374B
󃝌
C374C
󃝍
C374D
󃝎
C374E
󃝏
C374F
80
90
󃝐
C3750
󃝑
C3751
󃝒
C3752
󃝓
C3753
󃝔
C3754
󃝕
C3755
󃝖
C3756
󃝗
C3757
󃝘
C3758
󃝙
C3759
󃝚
C375A
󃝛
C375B
󃝜
C375C
󃝝
C375D
󃝞
C375E
󃝟
C375F
90
A0
󃝠
C3760
󃝡
C3761
󃝢
C3762
󃝣
C3763
󃝤
C3764
󃝥
C3765
󃝦
C3766
󃝧
C3767
󃝨
C3768
󃝩
C3769
󃝪
C376A
󃝫
C376B
󃝬
C376C
󃝭
C376D
󃝮
C376E
󃝯
C376F
A0
B0
󃝰
C3770
󃝱
C3771
󃝲
C3772
󃝳
C3773
󃝴
C3774
󃝵
C3775
󃝶
C3776
󃝷
C3777
󃝸
C3778
󃝹
C3779
󃝺
C377A
󃝻
C377B
󃝼
C377C
󃝽
C377D
󃝾
C377E
󃝿
C377F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]