International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B880

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𸀀
38000
𸀁
38001
𸀂
38002
𸀃
38003
𸀄
38004
𸀅
38005
𸀆
38006
𸀇
38007
𸀈
38008
𸀉
38009
𸀊
3800A
𸀋
3800B
𸀌
3800C
𸀍
3800D
𸀎
3800E
𸀏
3800F
80
90
𸀐
38010
𸀑
38011
𸀒
38012
𸀓
38013
𸀔
38014
𸀕
38015
𸀖
38016
𸀗
38017
𸀘
38018
𸀙
38019
𸀚
3801A
𸀛
3801B
𸀜
3801C
𸀝
3801D
𸀞
3801E
𸀟
3801F
90
A0
𸀠
38020
𸀡
38021
𸀢
38022
𸀣
38023
𸀤
38024
𸀥
38025
𸀦
38026
𸀧
38027
𸀨
38028
𸀩
38029
𸀪
3802A
𸀫
3802B
𸀬
3802C
𸀭
3802D
𸀮
3802E
𸀯
3802F
A0
B0
𸀰
38030
𸀱
38031
𸀲
38032
𸀳
38033
𸀴
38034
𸀵
38035
𸀶
38036
𸀷
38037
𸀸
38038
𸀹
38039
𸀺
3803A
𸀻
3803B
𸀼
3803C
𸀽
3803D
𸀾
3803E
𸀿
3803F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]