International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38093

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󀓀
C04C0
󀓁
C04C1
󀓂
C04C2
󀓃
C04C3
󀓄
C04C4
󀓅
C04C5
󀓆
C04C6
󀓇
C04C7
󀓈
C04C8
󀓉
C04C9
󀓊
C04CA
󀓋
C04CB
󀓌
C04CC
󀓍
C04CD
󀓎
C04CE
󀓏
C04CF
80
90
󀓐
C04D0
󀓑
C04D1
󀓒
C04D2
󀓓
C04D3
󀓔
C04D4
󀓕
C04D5
󀓖
C04D6
󀓗
C04D7
󀓘
C04D8
󀓙
C04D9
󀓚
C04DA
󀓛
C04DB
󀓜
C04DC
󀓝
C04DD
󀓞
C04DE
󀓟
C04DF
90
A0
󀓠
C04E0
󀓡
C04E1
󀓢
C04E2
󀓣
C04E3
󀓤
C04E4
󀓥
C04E5
󀓦
C04E6
󀓧
C04E7
󀓨
C04E8
󀓩
C04E9
󀓪
C04EA
󀓫
C04EB
󀓬
C04EC
󀓭
C04ED
󀓮
C04EE
󀓯
C04EF
A0
B0
󀓰
C04F0
󀓱
C04F1
󀓲
C04F2
󀓳
C04F3
󀓴
C04F4
󀓵
C04F5
󀓶
C04F6
󀓷
C04F7
󀓸
C04F8
󀓹
C04F9
󀓺
C04FA
󀓻
C04FB
󀓼
C04FC
󀓽
C04FD
󀓾
C04FE
󀓿
C04FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]