International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F289A0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򉠀
89800
򉠁
89801
򉠂
89802
򉠃
89803
򉠄
89804
򉠅
89805
򉠆
89806
򉠇
89807
򉠈
89808
򉠉
89809
򉠊
8980A
򉠋
8980B
򉠌
8980C
򉠍
8980D
򉠎
8980E
򉠏
8980F
80
90
򉠐
89810
򉠑
89811
򉠒
89812
򉠓
89813
򉠔
89814
򉠕
89815
򉠖
89816
򉠗
89817
򉠘
89818
򉠙
89819
򉠚
8981A
򉠛
8981B
򉠜
8981C
򉠝
8981D
򉠞
8981E
򉠟
8981F
90
A0
򉠠
89820
򉠡
89821
򉠢
89822
򉠣
89823
򉠤
89824
򉠥
89825
򉠦
89826
򉠧
89827
򉠨
89828
򉠩
89829
򉠪
8982A
򉠫
8982B
򉠬
8982C
򉠭
8982D
򉠮
8982E
򉠯
8982F
A0
B0
򉠰
89830
򉠱
89831
򉠲
89832
򉠳
89833
򉠴
89834
򉠵
89835
򉠶
89836
򉠷
89837
򉠸
89838
򉠹
89839
򉠺
8983A
򉠻
8983B
򉠼
8983C
򉠽
8983D
򉠾
8983E
򉠿
8983F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]