International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F399B2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󙲀
D9C80
󙲁
D9C81
󙲂
D9C82
󙲃
D9C83
󙲄
D9C84
󙲅
D9C85
󙲆
D9C86
󙲇
D9C87
󙲈
D9C88
󙲉
D9C89
󙲊
D9C8A
󙲋
D9C8B
󙲌
D9C8C
󙲍
D9C8D
󙲎
D9C8E
󙲏
D9C8F
80
90
󙲐
D9C90
󙲑
D9C91
󙲒
D9C92
󙲓
D9C93
󙲔
D9C94
󙲕
D9C95
󙲖
D9C96
󙲗
D9C97
󙲘
D9C98
󙲙
D9C99
󙲚
D9C9A
󙲛
D9C9B
󙲜
D9C9C
󙲝
D9C9D
󙲞
D9C9E
󙲟
D9C9F
90
A0
󙲠
D9CA0
󙲡
D9CA1
󙲢
D9CA2
󙲣
D9CA3
󙲤
D9CA4
󙲥
D9CA5
󙲦
D9CA6
󙲧
D9CA7
󙲨
D9CA8
󙲩
D9CA9
󙲪
D9CAA
󙲫
D9CAB
󙲬
D9CAC
󙲭
D9CAD
󙲮
D9CAE
󙲯
D9CAF
A0
B0
󙲰
D9CB0
󙲱
D9CB1
󙲲
D9CB2
󙲳
D9CB3
󙲴
D9CB4
󙲵
D9CB5
󙲶
D9CB6
󙲷
D9CB7
󙲸
D9CB8
󙲹
D9CB9
󙲺
D9CBA
󙲻
D9CBB
󙲼
D9CBC
󙲽
D9CBD
󙲾
D9CBE
󙲿
D9CBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]