International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
WINDOWS
UTF-8 windows-65001
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F190A9

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񐩀
50A40
񐩁
50A41
񐩂
50A42
񐩃
50A43
񐩄
50A44
񐩅
50A45
񐩆
50A46
񐩇
50A47
񐩈
50A48
񐩉
50A49
񐩊
50A4A
񐩋
50A4B
񐩌
50A4C
񐩍
50A4D
񐩎
50A4E
񐩏
50A4F
80
90
񐩐
50A50
񐩑
50A51
񐩒
50A52
񐩓
50A53
񐩔
50A54
񐩕
50A55
񐩖
50A56
񐩗
50A57
񐩘
50A58
񐩙
50A59
񐩚
50A5A
񐩛
50A5B
񐩜
50A5C
񐩝
50A5D
񐩞
50A5E
񐩟
50A5F
90
A0
񐩠
50A60
񐩡
50A61
񐩢
50A62
񐩣
50A63
񐩤
50A64
񐩥
50A65
񐩦
50A66
񐩧
50A67
񐩨
50A68
񐩩
50A69
񐩪
50A6A
񐩫
50A6B
񐩬
50A6C
񐩭
50A6D
񐩮
50A6E
񐩯
50A6F
A0
B0
񐩰
50A70
񐩱
50A71
񐩲
50A72
񐩳
50A73
񐩴
50A74
񐩵
50A75
񐩶
50A76
񐩷
50A77
񐩸
50A78
񐩹
50A79
񐩺
50A7A
񐩻
50A7B
񐩼
50A7C
񐩽
50A7D
񐩾
50A7E
񐩿
50A7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]