International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B3A2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󳢀
F3880
󳢁
F3881
󳢂
F3882
󳢃
F3883
󳢄
F3884
󳢅
F3885
󳢆
F3886
󳢇
F3887
󳢈
F3888
󳢉
F3889
󳢊
F388A
󳢋
F388B
󳢌
F388C
󳢍
F388D
󳢎
F388E
󳢏
F388F
80
90
󳢐
F3890
󳢑
F3891
󳢒
F3892
󳢓
F3893
󳢔
F3894
󳢕
F3895
󳢖
F3896
󳢗
F3897
󳢘
F3898
󳢙
F3899
󳢚
F389A
󳢛
F389B
󳢜
F389C
󳢝
F389D
󳢞
F389E
󳢟
F389F
90
A0
󳢠
F38A0
󳢡
F38A1
󳢢
F38A2
󳢣
F38A3
󳢤
F38A4
󳢥
F38A5
󳢦
F38A6
󳢧
F38A7
󳢨
F38A8
󳢩
F38A9
󳢪
F38AA
󳢫
F38AB
󳢬
F38AC
󳢭
F38AD
󳢮
F38AE
󳢯
F38AF
A0
B0
󳢰
F38B0
󳢱
F38B1
󳢲
F38B2
󳢳
F38B3
󳢴
F38B4
󳢵
F38B5
󳢶
F38B6
󳢷
F38B7
󳢸
F38B8
󳢹
F38B9
󳢺
F38BA
󳢻
F38BB
󳢼
F38BC
󳢽
F38BD
󳢾
F38BE
󳢿
F38BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]