International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0A4A2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𤢀
24880
𤢁
24881
𤢂
24882
𤢃
24883
𤢄
24884
𤢅
24885
𤢆
24886
𤢇
24887
𤢈
24888
𤢉
24889
𤢊
2488A
𤢋
2488B
𤢌
2488C
𤢍
2488D
𤢎
2488E
𤢏
2488F
80
90
𤢐
24890
𤢑
24891
𤢒
24892
𤢓
24893
𤢔
24894
𤢕
24895
𤢖
24896
𤢗
24897
𤢘
24898
𤢙
24899
𤢚
2489A
𤢛
2489B
𤢜
2489C
𤢝
2489D
𤢞
2489E
𤢟
2489F
90
A0
𤢠
248A0
𤢡
248A1
𤢢
248A2
𤢣
248A3
𤢤
248A4
𤢥
248A5
𤢦
248A6
𤢧
248A7
𤢨
248A8
𤢩
248A9
𤢪
248AA
𤢫
248AB
𤢬
248AC
𤢭
248AD
𤢮
248AE
𤢯
248AF
A0
B0
𤢰
248B0
𤢱
248B1
𤢲
248B2
𤢳
248B3
𤢴
248B4
𤢵
248B5
𤢶
248B6
𤢷
248B7
𤢸
248B8
𤢹
248B9
𤢺
248BA
𤢻
248BB
𤢼
248BC
𤢽
248BD
𤢾
248BE
𤢿
248BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]