International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18E80

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񎀀
4E000
񎀁
4E001
񎀂
4E002
񎀃
4E003
񎀄
4E004
񎀅
4E005
񎀆
4E006
񎀇
4E007
񎀈
4E008
񎀉
4E009
񎀊
4E00A
񎀋
4E00B
񎀌
4E00C
񎀍
4E00D
񎀎
4E00E
񎀏
4E00F
80
90
񎀐
4E010
񎀑
4E011
񎀒
4E012
񎀓
4E013
񎀔
4E014
񎀕
4E015
񎀖
4E016
񎀗
4E017
񎀘
4E018
񎀙
4E019
񎀚
4E01A
񎀛
4E01B
񎀜
4E01C
񎀝
4E01D
񎀞
4E01E
񎀟
4E01F
90
A0
񎀠
4E020
񎀡
4E021
񎀢
4E022
񎀣
4E023
񎀤
4E024
񎀥
4E025
񎀦
4E026
񎀧
4E027
񎀨
4E028
񎀩
4E029
񎀪
4E02A
񎀫
4E02B
񎀬
4E02C
񎀭
4E02D
񎀮
4E02E
񎀯
4E02F
A0
B0
񎀰
4E030
񎀱
4E031
񎀲
4E032
񎀳
4E033
񎀴
4E034
񎀵
4E035
񎀶
4E036
񎀷
4E037
񎀸
4E038
񎀹
4E039
񎀺
4E03A
񎀻
4E03B
񎀼
4E03C
񎀽
4E03D
񎀾
4E03E
񎀿
4E03F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]