International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38493

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󄓀
C44C0
󄓁
C44C1
󄓂
C44C2
󄓃
C44C3
󄓄
C44C4
󄓅
C44C5
󄓆
C44C6
󄓇
C44C7
󄓈
C44C8
󄓉
C44C9
󄓊
C44CA
󄓋
C44CB
󄓌
C44CC
󄓍
C44CD
󄓎
C44CE
󄓏
C44CF
80
90
󄓐
C44D0
󄓑
C44D1
󄓒
C44D2
󄓓
C44D3
󄓔
C44D4
󄓕
C44D5
󄓖
C44D6
󄓗
C44D7
󄓘
C44D8
󄓙
C44D9
󄓚
C44DA
󄓛
C44DB
󄓜
C44DC
󄓝
C44DD
󄓞
C44DE
󄓟
C44DF
90
A0
󄓠
C44E0
󄓡
C44E1
󄓢
C44E2
󄓣
C44E3
󄓤
C44E4
󄓥
C44E5
󄓦
C44E6
󄓧
C44E7
󄓨
C44E8
󄓩
C44E9
󄓪
C44EA
󄓫
C44EB
󄓬
C44EC
󄓭
C44ED
󄓮
C44EE
󄓯
C44EF
A0
B0
󄓰
C44F0
󄓱
C44F1
󄓲
C44F2
󄓳
C44F3
󄓴
C44F4
󄓵
C44F5
󄓶
C44F6
󄓷
C44F7
󄓸
C44F8
󄓹
C44F9
󄓺
C44FA
󄓻
C44FB
󄓼
C44FC
󄓽
C44FD
󄓾
C44FE
󄓿
C44FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]