International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29C80

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򜀀
9C000
򜀁
9C001
򜀂
9C002
򜀃
9C003
򜀄
9C004
򜀅
9C005
򜀆
9C006
򜀇
9C007
򜀈
9C008
򜀉
9C009
򜀊
9C00A
򜀋
9C00B
򜀌
9C00C
򜀍
9C00D
򜀎
9C00E
򜀏
9C00F
80
90
򜀐
9C010
򜀑
9C011
򜀒
9C012
򜀓
9C013
򜀔
9C014
򜀕
9C015
򜀖
9C016
򜀗
9C017
򜀘
9C018
򜀙
9C019
򜀚
9C01A
򜀛
9C01B
򜀜
9C01C
򜀝
9C01D
򜀞
9C01E
򜀟
9C01F
90
A0
򜀠
9C020
򜀡
9C021
򜀢
9C022
򜀣
9C023
򜀤
9C024
򜀥
9C025
򜀦
9C026
򜀧
9C027
򜀨
9C028
򜀩
9C029
򜀪
9C02A
򜀫
9C02B
򜀬
9C02C
򜀭
9C02D
򜀮
9C02E
򜀯
9C02F
A0
B0
򜀰
9C030
򜀱
9C031
򜀲
9C032
򜀳
9C033
򜀴
9C034
򜀵
9C035
򜀶
9C036
򜀷
9C037
򜀸
9C038
򜀹
9C039
򜀺
9C03A
򜀻
9C03B
򜀼
9C03C
򜀽
9C03D
򜀾
9C03E
򜀿
9C03F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]