International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F382A2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󂢀
C2880
󂢁
C2881
󂢂
C2882
󂢃
C2883
󂢄
C2884
󂢅
C2885
󂢆
C2886
󂢇
C2887
󂢈
C2888
󂢉
C2889
󂢊
C288A
󂢋
C288B
󂢌
C288C
󂢍
C288D
󂢎
C288E
󂢏
C288F
80
90
󂢐
C2890
󂢑
C2891
󂢒
C2892
󂢓
C2893
󂢔
C2894
󂢕
C2895
󂢖
C2896
󂢗
C2897
󂢘
C2898
󂢙
C2899
󂢚
C289A
󂢛
C289B
󂢜
C289C
󂢝
C289D
󂢞
C289E
󂢟
C289F
90
A0
󂢠
C28A0
󂢡
C28A1
󂢢
C28A2
󂢣
C28A3
󂢤
C28A4
󂢥
C28A5
󂢦
C28A6
󂢧
C28A7
󂢨
C28A8
󂢩
C28A9
󂢪
C28AA
󂢫
C28AB
󂢬
C28AC
󂢭
C28AD
󂢮
C28AE
󂢯
C28AF
A0
B0
󂢰
C28B0
󂢱
C28B1
󂢲
C28B2
󂢳
C28B3
󂢴
C28B4
󂢵
C28B5
󂢶
C28B6
󂢷
C28B7
󂢸
C28B8
󂢹
C28B9
󂢺
C28BA
󂢻
C28BB
󂢼
C28BC
󂢽
C28BD
󂢾
C28BE
󂢿
C28BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]