International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IBM IANA
UTF-8 ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F388B4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󈴀
C8D00
󈴁
C8D01
󈴂
C8D02
󈴃
C8D03
󈴄
C8D04
󈴅
C8D05
󈴆
C8D06
󈴇
C8D07
󈴈
C8D08
󈴉
C8D09
󈴊
C8D0A
󈴋
C8D0B
󈴌
C8D0C
󈴍
C8D0D
󈴎
C8D0E
󈴏
C8D0F
80
90
󈴐
C8D10
󈴑
C8D11
󈴒
C8D12
󈴓
C8D13
󈴔
C8D14
󈴕
C8D15
󈴖
C8D16
󈴗
C8D17
󈴘
C8D18
󈴙
C8D19
󈴚
C8D1A
󈴛
C8D1B
󈴜
C8D1C
󈴝
C8D1D
󈴞
C8D1E
󈴟
C8D1F
90
A0
󈴠
C8D20
󈴡
C8D21
󈴢
C8D22
󈴣
C8D23
󈴤
C8D24
󈴥
C8D25
󈴦
C8D26
󈴧
C8D27
󈴨
C8D28
󈴩
C8D29
󈴪
C8D2A
󈴫
C8D2B
󈴬
C8D2C
󈴭
C8D2D
󈴮
C8D2E
󈴯
C8D2F
A0
B0
󈴰
C8D30
󈴱
C8D31
󈴲
C8D32
󈴳
C8D33
󈴴
C8D34
󈴵
C8D35
󈴶
C8D36
󈴷
C8D37
󈴸
C8D38
󈴹
C8D39
󈴺
C8D3A
󈴻
C8D3B
󈴼
C8D3C
󈴽
C8D3D
󈴾
C8D3E
󈴿
C8D3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]