International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3849F

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󄟀
C47C0
󄟁
C47C1
󄟂
C47C2
󄟃
C47C3
󄟄
C47C4
󄟅
C47C5
󄟆
C47C6
󄟇
C47C7
󄟈
C47C8
󄟉
C47C9
󄟊
C47CA
󄟋
C47CB
󄟌
C47CC
󄟍
C47CD
󄟎
C47CE
󄟏
C47CF
80
90
󄟐
C47D0
󄟑
C47D1
󄟒
C47D2
󄟓
C47D3
󄟔
C47D4
󄟕
C47D5
󄟖
C47D6
󄟗
C47D7
󄟘
C47D8
󄟙
C47D9
󄟚
C47DA
󄟛
C47DB
󄟜
C47DC
󄟝
C47DD
󄟞
C47DE
󄟟
C47DF
90
A0
󄟠
C47E0
󄟡
C47E1
󄟢
C47E2
󄟣
C47E3
󄟤
C47E4
󄟥
C47E5
󄟦
C47E6
󄟧
C47E7
󄟨
C47E8
󄟩
C47E9
󄟪
C47EA
󄟫
C47EB
󄟬
C47EC
󄟭
C47ED
󄟮
C47EE
󄟯
C47EF
A0
B0
󄟰
C47F0
󄟱
C47F1
󄟲
C47F2
󄟳
C47F3
󄟴
C47F4
󄟵
C47F5
󄟶
C47F6
󄟷
C47F7
󄟸
C47F8
󄟹
C47F9
󄟺
C47FA
󄟻
C47FB
󄟼
C47FC
󄟽
C47FD
󄟾
C47FE
󄟿
C47FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]