International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
WINDOWS
UTF-8 windows-65001
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F19CA2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񜢀
5C880
񜢁
5C881
񜢂
5C882
񜢃
5C883
񜢄
5C884
񜢅
5C885
񜢆
5C886
񜢇
5C887
񜢈
5C888
񜢉
5C889
񜢊
5C88A
񜢋
5C88B
񜢌
5C88C
񜢍
5C88D
񜢎
5C88E
񜢏
5C88F
80
90
񜢐
5C890
񜢑
5C891
񜢒
5C892
񜢓
5C893
񜢔
5C894
񜢕
5C895
񜢖
5C896
񜢗
5C897
񜢘
5C898
񜢙
5C899
񜢚
5C89A
񜢛
5C89B
񜢜
5C89C
񜢝
5C89D
񜢞
5C89E
񜢟
5C89F
90
A0
񜢠
5C8A0
񜢡
5C8A1
񜢢
5C8A2
񜢣
5C8A3
񜢤
5C8A4
񜢥
5C8A5
񜢦
5C8A6
񜢧
5C8A7
񜢨
5C8A8
񜢩
5C8A9
񜢪
5C8AA
񜢫
5C8AB
񜢬
5C8AC
񜢭
5C8AD
񜢮
5C8AE
񜢯
5C8AF
A0
B0
񜢰
5C8B0
񜢱
5C8B1
񜢲
5C8B2
񜢳
5C8B3
񜢴
5C8B4
񜢵
5C8B5
񜢶
5C8B6
񜢷
5C8B7
񜢸
5C8B8
񜢹
5C8B9
񜢺
5C8BA
񜢻
5C8BB
񜢼
5C8BC
񜢽
5C8BD
񜢾
5C8BE
񜢿
5C8BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]