International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38B80

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󋀀
CB000
󋀁
CB001
󋀂
CB002
󋀃
CB003
󋀄
CB004
󋀅
CB005
󋀆
CB006
󋀇
CB007
󋀈
CB008
󋀉
CB009
󋀊
CB00A
󋀋
CB00B
󋀌
CB00C
󋀍
CB00D
󋀎
CB00E
󋀏
CB00F
80
90
󋀐
CB010
󋀑
CB011
󋀒
CB012
󋀓
CB013
󋀔
CB014
󋀕
CB015
󋀖
CB016
󋀗
CB017
󋀘
CB018
󋀙
CB019
󋀚
CB01A
󋀛
CB01B
󋀜
CB01C
󋀝
CB01D
󋀞
CB01E
󋀟
CB01F
90
A0
󋀠
CB020
󋀡
CB021
󋀢
CB022
󋀣
CB023
󋀤
CB024
󋀥
CB025
󋀦
CB026
󋀧
CB027
󋀨
CB028
󋀩
CB029
󋀪
CB02A
󋀫
CB02B
󋀬
CB02C
󋀭
CB02D
󋀮
CB02E
󋀯
CB02F
A0
B0
󋀰
CB030
󋀱
CB031
󋀲
CB032
󋀳
CB033
󋀴
CB034
󋀵
CB035
󋀶
CB036
󋀷
CB037
󋀸
CB038
󋀹
CB039
󋀺
CB03A
󋀻
CB03B
󋀼
CB03C
󋀽
CB03D
󋀾
CB03E
󋀿
CB03F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]