International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IBM IANA
UTF-8 ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F384BD

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󄽀
C4F40
󄽁
C4F41
󄽂
C4F42
󄽃
C4F43
󄽄
C4F44
󄽅
C4F45
󄽆
C4F46
󄽇
C4F47
󄽈
C4F48
󄽉
C4F49
󄽊
C4F4A
󄽋
C4F4B
󄽌
C4F4C
󄽍
C4F4D
󄽎
C4F4E
󄽏
C4F4F
80
90
󄽐
C4F50
󄽑
C4F51
󄽒
C4F52
󄽓
C4F53
󄽔
C4F54
󄽕
C4F55
󄽖
C4F56
󄽗
C4F57
󄽘
C4F58
󄽙
C4F59
󄽚
C4F5A
󄽛
C4F5B
󄽜
C4F5C
󄽝
C4F5D
󄽞
C4F5E
󄽟
C4F5F
90
A0
󄽠
C4F60
󄽡
C4F61
󄽢
C4F62
󄽣
C4F63
󄽤
C4F64
󄽥
C4F65
󄽦
C4F66
󄽧
C4F67
󄽨
C4F68
󄽩
C4F69
󄽪
C4F6A
󄽫
C4F6B
󄽬
C4F6C
󄽭
C4F6D
󄽮
C4F6E
󄽯
C4F6F
A0
B0
󄽰
C4F70
󄽱
C4F71
󄽲
C4F72
󄽳
C4F73
󄽴
C4F74
󄽵
C4F75
󄽶
C4F76
󄽷
C4F77
󄽸
C4F78
󄽹
C4F79
󄽺
C4F7A
󄽻
C4F7B
󄽼
C4F7C
󄽽
C4F7D
󄽾
C4F7E
󄽿
C4F7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]