International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2B2A0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򲠀
B2800
򲠁
B2801
򲠂
B2802
򲠃
B2803
򲠄
B2804
򲠅
B2805
򲠆
B2806
򲠇
B2807
򲠈
B2808
򲠉
B2809
򲠊
B280A
򲠋
B280B
򲠌
B280C
򲠍
B280D
򲠎
B280E
򲠏
B280F
80
90
򲠐
B2810
򲠑
B2811
򲠒
B2812
򲠓
B2813
򲠔
B2814
򲠕
B2815
򲠖
B2816
򲠗
B2817
򲠘
B2818
򲠙
B2819
򲠚
B281A
򲠛
B281B
򲠜
B281C
򲠝
B281D
򲠞
B281E
򲠟
B281F
90
A0
򲠠
B2820
򲠡
B2821
򲠢
B2822
򲠣
B2823
򲠤
B2824
򲠥
B2825
򲠦
B2826
򲠧
B2827
򲠨
B2828
򲠩
B2829
򲠪
B282A
򲠫
B282B
򲠬
B282C
򲠭
B282D
򲠮
B282E
򲠯
B282F
A0
B0
򲠰
B2830
򲠱
B2831
򲠲
B2832
򲠳
B2833
򲠴
B2834
򲠵
B2835
򲠶
B2836
򲠷
B2837
򲠸
B2838
򲠹
B2839
򲠺
B283A
򲠻
B283B
򲠼
B283C
򲠽
B283D
򲠾
B283E
򲠿
B283F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]