International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IBM IANA
UTF-8 ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F387A2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󇢀
C7880
󇢁
C7881
󇢂
C7882
󇢃
C7883
󇢄
C7884
󇢅
C7885
󇢆
C7886
󇢇
C7887
󇢈
C7888
󇢉
C7889
󇢊
C788A
󇢋
C788B
󇢌
C788C
󇢍
C788D
󇢎
C788E
󇢏
C788F
80
90
󇢐
C7890
󇢑
C7891
󇢒
C7892
󇢓
C7893
󇢔
C7894
󇢕
C7895
󇢖
C7896
󇢗
C7897
󇢘
C7898
󇢙
C7899
󇢚
C789A
󇢛
C789B
󇢜
C789C
󇢝
C789D
󇢞
C789E
󇢟
C789F
90
A0
󇢠
C78A0
󇢡
C78A1
󇢢
C78A2
󇢣
C78A3
󇢤
C78A4
󇢥
C78A5
󇢦
C78A6
󇢧
C78A7
󇢨
C78A8
󇢩
C78A9
󇢪
C78AA
󇢫
C78AB
󇢬
C78AC
󇢭
C78AD
󇢮
C78AE
󇢯
C78AF
A0
B0
󇢰
C78B0
󇢱
C78B1
󇢲
C78B2
󇢳
C78B3
󇢴
C78B4
󇢵
C78B5
󇢶
C78B6
󇢷
C78B7
󇢸
C78B8
󇢹
C78B9
󇢺
C78BA
󇢻
C78BB
󇢼
C78BC
󇢽
C78BD
󇢾
C78BE
󇢿
C78BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]