International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A9B2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󩲀
E9C80
󩲁
E9C81
󩲂
E9C82
󩲃
E9C83
󩲄
E9C84
󩲅
E9C85
󩲆
E9C86
󩲇
E9C87
󩲈
E9C88
󩲉
E9C89
󩲊
E9C8A
󩲋
E9C8B
󩲌
E9C8C
󩲍
E9C8D
󩲎
E9C8E
󩲏
E9C8F
80
90
󩲐
E9C90
󩲑
E9C91
󩲒
E9C92
󩲓
E9C93
󩲔
E9C94
󩲕
E9C95
󩲖
E9C96
󩲗
E9C97
󩲘
E9C98
󩲙
E9C99
󩲚
E9C9A
󩲛
E9C9B
󩲜
E9C9C
󩲝
E9C9D
󩲞
E9C9E
󩲟
E9C9F
90
A0
󩲠
E9CA0
󩲡
E9CA1
󩲢
E9CA2
󩲣
E9CA3
󩲤
E9CA4
󩲥
E9CA5
󩲦
E9CA6
󩲧
E9CA7
󩲨
E9CA8
󩲩
E9CA9
󩲪
E9CAA
󩲫
E9CAB
󩲬
E9CAC
󩲭
E9CAD
󩲮
E9CAE
󩲯
E9CAF
A0
B0
󩲰
E9CB0
󩲱
E9CB1
󩲲
E9CB2
󩲳
E9CB3
󩲴
E9CB4
󩲵
E9CB5
󩲶
E9CB6
󩲷
E9CB7
󩲸
E9CB8
󩲹
E9CB9
󩲺
E9CBA
󩲻
E9CBB
󩲼
E9CBC
󩲽
E9CBD
󩲾
E9CBE
󩲿
E9CBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]