International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IBM IANA
UTF-8 ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F381B6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󁶀
C1D80
󁶁
C1D81
󁶂
C1D82
󁶃
C1D83
󁶄
C1D84
󁶅
C1D85
󁶆
C1D86
󁶇
C1D87
󁶈
C1D88
󁶉
C1D89
󁶊
C1D8A
󁶋
C1D8B
󁶌
C1D8C
󁶍
C1D8D
󁶎
C1D8E
󁶏
C1D8F
80
90
󁶐
C1D90
󁶑
C1D91
󁶒
C1D92
󁶓
C1D93
󁶔
C1D94
󁶕
C1D95
󁶖
C1D96
󁶗
C1D97
󁶘
C1D98
󁶙
C1D99
󁶚
C1D9A
󁶛
C1D9B
󁶜
C1D9C
󁶝
C1D9D
󁶞
C1D9E
󁶟
C1D9F
90
A0
󁶠
C1DA0
󁶡
C1DA1
󁶢
C1DA2
󁶣
C1DA3
󁶤
C1DA4
󁶥
C1DA5
󁶦
C1DA6
󁶧
C1DA7
󁶨
C1DA8
󁶩
C1DA9
󁶪
C1DAA
󁶫
C1DAB
󁶬
C1DAC
󁶭
C1DAD
󁶮
C1DAE
󁶯
C1DAF
A0
B0
󁶰
C1DB0
󁶱
C1DB1
󁶲
C1DB2
󁶳
C1DB3
󁶴
C1DB4
󁶵
C1DB5
󁶶
C1DB6
󁶷
C1DB7
󁶸
C1DB8
󁶹
C1DB9
󁶺
C1DBA
󁶻
C1DBB
󁶼
C1DBC
󁶽
C1DBD
󁶾
C1DBE
󁶿
C1DBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]