International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28CA0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򌠀
8C800
򌠁
8C801
򌠂
8C802
򌠃
8C803
򌠄
8C804
򌠅
8C805
򌠆
8C806
򌠇
8C807
򌠈
8C808
򌠉
8C809
򌠊
8C80A
򌠋
8C80B
򌠌
8C80C
򌠍
8C80D
򌠎
8C80E
򌠏
8C80F
80
90
򌠐
8C810
򌠑
8C811
򌠒
8C812
򌠓
8C813
򌠔
8C814
򌠕
8C815
򌠖
8C816
򌠗
8C817
򌠘
8C818
򌠙
8C819
򌠚
8C81A
򌠛
8C81B
򌠜
8C81C
򌠝
8C81D
򌠞
8C81E
򌠟
8C81F
90
A0
򌠠
8C820
򌠡
8C821
򌠢
8C822
򌠣
8C823
򌠤
8C824
򌠥
8C825
򌠦
8C826
򌠧
8C827
򌠨
8C828
򌠩
8C829
򌠪
8C82A
򌠫
8C82B
򌠬
8C82C
򌠭
8C82D
򌠮
8C82E
򌠯
8C82F
A0
B0
򌠰
8C830
򌠱
8C831
򌠲
8C832
򌠳
8C833
򌠴
8C834
򌠵
8C835
򌠶
8C836
򌠷
8C837
򌠸
8C838
򌠹
8C839
򌠺
8C83A
򌠻
8C83B
򌠼
8C83C
򌠽
8C83D
򌠾
8C83E
򌠿
8C83F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]