International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18E96

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񎖀
4E580
񎖁
4E581
񎖂
4E582
񎖃
4E583
񎖄
4E584
񎖅
4E585
񎖆
4E586
񎖇
4E587
񎖈
4E588
񎖉
4E589
񎖊
4E58A
񎖋
4E58B
񎖌
4E58C
񎖍
4E58D
񎖎
4E58E
񎖏
4E58F
80
90
񎖐
4E590
񎖑
4E591
񎖒
4E592
񎖓
4E593
񎖔
4E594
񎖕
4E595
񎖖
4E596
񎖗
4E597
񎖘
4E598
񎖙
4E599
񎖚
4E59A
񎖛
4E59B
񎖜
4E59C
񎖝
4E59D
񎖞
4E59E
񎖟
4E59F
90
A0
񎖠
4E5A0
񎖡
4E5A1
񎖢
4E5A2
񎖣
4E5A3
񎖤
4E5A4
񎖥
4E5A5
񎖦
4E5A6
񎖧
4E5A7
񎖨
4E5A8
񎖩
4E5A9
񎖪
4E5AA
񎖫
4E5AB
񎖬
4E5AC
񎖭
4E5AD
񎖮
4E5AE
񎖯
4E5AF
A0
B0
񎖰
4E5B0
񎖱
4E5B1
񎖲
4E5B2
񎖳
4E5B3
񎖴
4E5B4
񎖵
4E5B5
񎖶
4E5B6
񎖷
4E5B7
񎖸
4E5B8
񎖹
4E5B9
񎖺
4E5BA
񎖻
4E5BB
񎖼
4E5BC
񎖽
4E5BD
񎖾
4E5BE
񎖿
4E5BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]