International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1829D

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񂝀
42740
񂝁
42741
񂝂
42742
񂝃
42743
񂝄
42744
񂝅
42745
񂝆
42746
񂝇
42747
񂝈
42748
񂝉
42749
񂝊
4274A
񂝋
4274B
񂝌
4274C
񂝍
4274D
񂝎
4274E
񂝏
4274F
80
90
񂝐
42750
񂝑
42751
񂝒
42752
񂝓
42753
񂝔
42754
񂝕
42755
񂝖
42756
񂝗
42757
񂝘
42758
񂝙
42759
񂝚
4275A
񂝛
4275B
񂝜
4275C
񂝝
4275D
񂝞
4275E
񂝟
4275F
90
A0
񂝠
42760
񂝡
42761
񂝢
42762
񂝣
42763
񂝤
42764
񂝥
42765
񂝦
42766
񂝧
42767
񂝨
42768
񂝩
42769
񂝪
4276A
񂝫
4276B
񂝬
4276C
񂝭
4276D
񂝮
4276E
񂝯
4276F
A0
B0
񂝰
42770
񂝱
42771
񂝲
42772
񂝳
42773
񂝴
42774
񂝵
42775
񂝶
42776
񂝷
42777
񂝸
42778
񂝹
42779
񂝺
4277A
񂝻
4277B
񂝼
4277C
񂝽
4277D
񂝾
4277E
񂝿
4277F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]