International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29CA0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򜠀
9C800
򜠁
9C801
򜠂
9C802
򜠃
9C803
򜠄
9C804
򜠅
9C805
򜠆
9C806
򜠇
9C807
򜠈
9C808
򜠉
9C809
򜠊
9C80A
򜠋
9C80B
򜠌
9C80C
򜠍
9C80D
򜠎
9C80E
򜠏
9C80F
80
90
򜠐
9C810
򜠑
9C811
򜠒
9C812
򜠓
9C813
򜠔
9C814
򜠕
9C815
򜠖
9C816
򜠗
9C817
򜠘
9C818
򜠙
9C819
򜠚
9C81A
򜠛
9C81B
򜠜
9C81C
򜠝
9C81D
򜠞
9C81E
򜠟
9C81F
90
A0
򜠠
9C820
򜠡
9C821
򜠢
9C822
򜠣
9C823
򜠤
9C824
򜠥
9C825
򜠦
9C826
򜠧
9C827
򜠨
9C828
򜠩
9C829
򜠪
9C82A
򜠫
9C82B
򜠬
9C82C
򜠭
9C82D
򜠮
9C82E
򜠯
9C82F
A0
B0
򜠰
9C830
򜠱
9C831
򜠲
9C832
򜠳
9C833
򜠴
9C834
򜠵
9C835
򜠶
9C836
򜠷
9C837
򜠸
9C838
򜠹
9C839
򜠺
9C83A
򜠻
9C83B
򜠼
9C83C
򜠽
9C83D
򜠾
9C83E
򜠿
9C83F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]