International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1B5A5

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񵥀
75940
񵥁
75941
񵥂
75942
񵥃
75943
񵥄
75944
񵥅
75945
񵥆
75946
񵥇
75947
񵥈
75948
񵥉
75949
񵥊
7594A
񵥋
7594B
񵥌
7594C
񵥍
7594D
񵥎
7594E
񵥏
7594F
80
90
񵥐
75950
񵥑
75951
񵥒
75952
񵥓
75953
񵥔
75954
񵥕
75955
񵥖
75956
񵥗
75957
񵥘
75958
񵥙
75959
񵥚
7595A
񵥛
7595B
񵥜
7595C
񵥝
7595D
񵥞
7595E
񵥟
7595F
90
A0
񵥠
75960
񵥡
75961
񵥢
75962
񵥣
75963
񵥤
75964
񵥥
75965
񵥦
75966
񵥧
75967
񵥨
75968
񵥩
75969
񵥪
7596A
񵥫
7596B
񵥬
7596C
񵥭
7596D
񵥮
7596E
񵥯
7596F
A0
B0
񵥰
75970
񵥱
75971
񵥲
75972
񵥳
75973
񵥴
75974
񵥵
75975
񵥶
75976
񵥷
75977
񵥸
75978
񵥹
75979
񵥺
7597A
񵥻
7597B
񵥼
7597C
񵥽
7597D
񵥾
7597E
񵥿
7597F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]