International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48C9E

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􌞀
10C780
􌞁
10C781
􌞂
10C782
􌞃
10C783
􌞄
10C784
􌞅
10C785
􌞆
10C786
􌞇
10C787
􌞈
10C788
􌞉
10C789
􌞊
10C78A
􌞋
10C78B
􌞌
10C78C
􌞍
10C78D
􌞎
10C78E
􌞏
10C78F
80
90
􌞐
10C790
􌞑
10C791
􌞒
10C792
􌞓
10C793
􌞔
10C794
􌞕
10C795
􌞖
10C796
􌞗
10C797
􌞘
10C798
􌞙
10C799
􌞚
10C79A
􌞛
10C79B
􌞜
10C79C
􌞝
10C79D
􌞞
10C79E
􌞟
10C79F
90
A0
􌞠
10C7A0
􌞡
10C7A1
􌞢
10C7A2
􌞣
10C7A3
􌞤
10C7A4
􌞥
10C7A5
􌞦
10C7A6
􌞧
10C7A7
􌞨
10C7A8
􌞩
10C7A9
􌞪
10C7AA
􌞫
10C7AB
􌞬
10C7AC
􌞭
10C7AD
􌞮
10C7AE
􌞯
10C7AF
A0
B0
􌞰
10C7B0
􌞱
10C7B1
􌞲
10C7B2
􌞳
10C7B3
􌞴
10C7B4
􌞵
10C7B5
􌞶
10C7B6
􌞷
10C7B7
􌞸
10C7B8
􌞹
10C7B9
􌞺
10C7BA
􌞻
10C7BB
􌞼
10C7BC
􌞽
10C7BD
􌞾
10C7BE
􌞿
10C7BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]