International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F189B6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񉶀
49D80
񉶁
49D81
񉶂
49D82
񉶃
49D83
񉶄
49D84
񉶅
49D85
񉶆
49D86
񉶇
49D87
񉶈
49D88
񉶉
49D89
񉶊
49D8A
񉶋
49D8B
񉶌
49D8C
񉶍
49D8D
񉶎
49D8E
񉶏
49D8F
80
90
񉶐
49D90
񉶑
49D91
񉶒
49D92
񉶓
49D93
񉶔
49D94
񉶕
49D95
񉶖
49D96
񉶗
49D97
񉶘
49D98
񉶙
49D99
񉶚
49D9A
񉶛
49D9B
񉶜
49D9C
񉶝
49D9D
񉶞
49D9E
񉶟
49D9F
90
A0
񉶠
49DA0
񉶡
49DA1
񉶢
49DA2
񉶣
49DA3
񉶤
49DA4
񉶥
49DA5
񉶦
49DA6
񉶧
49DA7
񉶨
49DA8
񉶩
49DA9
񉶪
49DAA
񉶫
49DAB
񉶬
49DAC
񉶭
49DAD
񉶮
49DAE
񉶯
49DAF
A0
B0
񉶰
49DB0
񉶱
49DB1
񉶲
49DB2
񉶳
49DB3
񉶴
49DB4
񉶵
49DB5
񉶶
49DB6
񉶷
49DB7
񉶸
49DB8
񉶹
49DB9
񉶺
49DBA
񉶻
49DBB
񉶼
49DBC
񉶽
49DBD
񉶾
49DBE
񉶿
49DBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]