International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38DA0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󍠀
CD800
󍠁
CD801
󍠂
CD802
󍠃
CD803
󍠄
CD804
󍠅
CD805
󍠆
CD806
󍠇
CD807
󍠈
CD808
󍠉
CD809
󍠊
CD80A
󍠋
CD80B
󍠌
CD80C
󍠍
CD80D
󍠎
CD80E
󍠏
CD80F
80
90
󍠐
CD810
󍠑
CD811
󍠒
CD812
󍠓
CD813
󍠔
CD814
󍠕
CD815
󍠖
CD816
󍠗
CD817
󍠘
CD818
󍠙
CD819
󍠚
CD81A
󍠛
CD81B
󍠜
CD81C
󍠝
CD81D
󍠞
CD81E
󍠟
CD81F
90
A0
󍠠
CD820
󍠡
CD821
󍠢
CD822
󍠣
CD823
󍠤
CD824
󍠥
CD825
󍠦
CD826
󍠧
CD827
󍠨
CD828
󍠩
CD829
󍠪
CD82A
󍠫
CD82B
󍠬
CD82C
󍠭
CD82D
󍠮
CD82E
󍠯
CD82F
A0
B0
󍠰
CD830
󍠱
CD831
󍠲
CD832
󍠳
CD833
󍠴
CD834
󍠵
CD835
󍠶
CD836
󍠷
CD837
󍠸
CD838
󍠹
CD839
󍠺
CD83A
󍠻
CD83B
󍠼
CD83C
󍠽
CD83D
󍠾
CD83E
󍠿
CD83F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]