International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28E9E

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򎞀
8E780
򎞁
8E781
򎞂
8E782
򎞃
8E783
򎞄
8E784
򎞅
8E785
򎞆
8E786
򎞇
8E787
򎞈
8E788
򎞉
8E789
򎞊
8E78A
򎞋
8E78B
򎞌
8E78C
򎞍
8E78D
򎞎
8E78E
򎞏
8E78F
80
90
򎞐
8E790
򎞑
8E791
򎞒
8E792
򎞓
8E793
򎞔
8E794
򎞕
8E795
򎞖
8E796
򎞗
8E797
򎞘
8E798
򎞙
8E799
򎞚
8E79A
򎞛
8E79B
򎞜
8E79C
򎞝
8E79D
򎞞
8E79E
򎞟
8E79F
90
A0
򎞠
8E7A0
򎞡
8E7A1
򎞢
8E7A2
򎞣
8E7A3
򎞤
8E7A4
򎞥
8E7A5
򎞦
8E7A6
򎞧
8E7A7
򎞨
8E7A8
򎞩
8E7A9
򎞪
8E7AA
򎞫
8E7AB
򎞬
8E7AC
򎞭
8E7AD
򎞮
8E7AE
򎞯
8E7AF
A0
B0
򎞰
8E7B0
򎞱
8E7B1
򎞲
8E7B2
򎞳
8E7B3
򎞴
8E7B4
򎞵
8E7B5
򎞶
8E7B6
򎞷
8E7B7
򎞸
8E7B8
򎞹
8E7B9
򎞺
8E7BA
򎞻
8E7BB
򎞼
8E7BC
򎞽
8E7BD
򎞾
8E7BE
򎞿
8E7BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]