International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28997

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򉗀
895C0
򉗁
895C1
򉗂
895C2
򉗃
895C3
򉗄
895C4
򉗅
895C5
򉗆
895C6
򉗇
895C7
򉗈
895C8
򉗉
895C9
򉗊
895CA
򉗋
895CB
򉗌
895CC
򉗍
895CD
򉗎
895CE
򉗏
895CF
80
90
򉗐
895D0
򉗑
895D1
򉗒
895D2
򉗓
895D3
򉗔
895D4
򉗕
895D5
򉗖
895D6
򉗗
895D7
򉗘
895D8
򉗙
895D9
򉗚
895DA
򉗛
895DB
򉗜
895DC
򉗝
895DD
򉗞
895DE
򉗟
895DF
90
A0
򉗠
895E0
򉗡
895E1
򉗢
895E2
򉗣
895E3
򉗤
895E4
򉗥
895E5
򉗦
895E6
򉗧
895E7
򉗨
895E8
򉗩
895E9
򉗪
895EA
򉗫
895EB
򉗬
895EC
򉗭
895ED
򉗮
895EE
򉗯
895EF
A0
B0
򉗰
895F0
򉗱
895F1
򉗲
895F2
򉗳
895F3
򉗴
895F4
򉗵
895F5
򉗶
895F6
򉗷
895F7
򉗸
895F8
򉗹
895F9
򉗺
895FA
򉗻
895FB
򉗼
895FC
򉗽
895FD
򉗾
895FE
򉗿
895FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]