International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38496

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󄖀
C4580
󄖁
C4581
󄖂
C4582
󄖃
C4583
󄖄
C4584
󄖅
C4585
󄖆
C4586
󄖇
C4587
󄖈
C4588
󄖉
C4589
󄖊
C458A
󄖋
C458B
󄖌
C458C
󄖍
C458D
󄖎
C458E
󄖏
C458F
80
90
󄖐
C4590
󄖑
C4591
󄖒
C4592
󄖓
C4593
󄖔
C4594
󄖕
C4595
󄖖
C4596
󄖗
C4597
󄖘
C4598
󄖙
C4599
󄖚
C459A
󄖛
C459B
󄖜
C459C
󄖝
C459D
󄖞
C459E
󄖟
C459F
90
A0
󄖠
C45A0
󄖡
C45A1
󄖢
C45A2
󄖣
C45A3
󄖤
C45A4
󄖥
C45A5
󄖦
C45A6
󄖧
C45A7
󄖨
C45A8
󄖩
C45A9
󄖪
C45AA
󄖫
C45AB
󄖬
C45AC
󄖭
C45AD
󄖮
C45AE
󄖯
C45AF
A0
B0
󄖰
C45B0
󄖱
C45B1
󄖲
C45B2
󄖳
C45B3
󄖴
C45B4
󄖵
C45B5
󄖶
C45B6
󄖷
C45B7
󄖸
C45B8
󄖹
C45B9
󄖺
C45BA
󄖻
C45BB
󄖼
C45BC
󄖽
C45BD
󄖾
C45BE
󄖿
C45BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]