International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39DA0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󝠀
DD800
󝠁
DD801
󝠂
DD802
󝠃
DD803
󝠄
DD804
󝠅
DD805
󝠆
DD806
󝠇
DD807
󝠈
DD808
󝠉
DD809
󝠊
DD80A
󝠋
DD80B
󝠌
DD80C
󝠍
DD80D
󝠎
DD80E
󝠏
DD80F
80
90
󝠐
DD810
󝠑
DD811
󝠒
DD812
󝠓
DD813
󝠔
DD814
󝠕
DD815
󝠖
DD816
󝠗
DD817
󝠘
DD818
󝠙
DD819
󝠚
DD81A
󝠛
DD81B
󝠜
DD81C
󝠝
DD81D
󝠞
DD81E
󝠟
DD81F
90
A0
󝠠
DD820
󝠡
DD821
󝠢
DD822
󝠣
DD823
󝠤
DD824
󝠥
DD825
󝠦
DD826
󝠧
DD827
󝠨
DD828
󝠩
DD829
󝠪
DD82A
󝠫
DD82B
󝠬
DD82C
󝠭
DD82D
󝠮
DD82E
󝠯
DD82F
A0
B0
󝠰
DD830
󝠱
DD831
󝠲
DD832
󝠳
DD833
󝠴
DD834
󝠵
DD835
󝠶
DD836
󝠷
DD837
󝠸
DD838
󝠹
DD839
󝠺
DD83A
󝠻
DD83B
󝠼
DD83C
󝠽
DD83D
󝠾
DD83E
󝠿
DD83F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]